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PREFACE 


HE composition of this book requires some explanation, 
Chapters I, II, and III of Part II originally formed a 
doctorate thesis* on “The Use of the Theory of Correlation in 
Psychology,” and were privately printed as such in August of last 
year. The other chapters of the book have, as it were, been 
written round this nucleus, with a view to producing a concise 
yet fairly complete account of the various quantitative methods 
employed in psychology. The slghtly controversial nature 
of the thesis is intentionally retained, since the interests of 
quantitative psychology would seem to be best served thus— 
at least until the biometric methods of Professor Karl Pearson 
and his school become more generally known among psycho- 
logists than they are at present. Perhaps it is as well to add 


that I am alone responsible for the views therein expressed. 


Although written primarily for the professed psychologist, it 
is hoped that the book will also prove of use to the educationist. 
The theory of correlation is likely to be found especially valuable 
in educational psychology, if only its presuppositions are clearly 
grasped and the requisite precautions in its application exercised ; 
and now that a real training in psychology is beginning to be 


regarded as an essential part of a teacher’s equipment, we may 


* Approved for the Degree of Doctor of Science in the University of London. 
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expect valuable contributions to psychology from teachers in 
the immediate future. If this book is found helpful on the 
quantitative side to such research students, its existence will 


be justified. 


My thanks are due to the Editors of the British Journal of 
Psychology, who have very kindly allowed me to re-publish an 


article from that periodical. 


W. 3. 


May, 1911. 
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PSYCHO-PHYSICS 


CHAPTER 1 
MENTAL MEASUREMENT 


The pre-conditions of measurement in any sphere of ex- 
perience are (1) the homogeneity of the phenomenon, or of any 
particular aspect of it, to be measured, (2) the possibility of 
fixing a unit in terms of which the measurement may be 
made, and of which the total magnitude may be regarded as 
a mere multiple or sub-multiple. These pre-requisites are 
satisfied in the cases of spatial and temporal magnitudes, in 
terms of which, directly or indirectly, all the measurements of 
the physical sciences are expressed. It was thought by 
Fechner that they also applied in the case of the strictly 
psychical phenomenon of sensation-intensity, i.e. it was assumed 
that any given sensation-intensity might be regarded as made 
up of a sum of unit sensation-intensities. This view has been 
definitely rejected by later psychologists. Every sensation- 
intensity is qualitatively distinct from every other sensation- 
intensity. No addition or subtraction of intensities 1s possible. 
“To introspection, our feeling of pink 1s surely not a portion of 
our feeling of scarlet; nor does the light of an electric arc seem 
to contain that of a tallow-candle in itself” (James)*. Fechner’s 
mistake was due to a confusion of sensation-intensities with 
the (physical) stimulus-values required to produce them. The 
latter can be measured, the former cannot. 

Nevertheless, purely psychical measurement is not entirely 
impossible. Within any one series of sensation-intensities, 


* Principles of Psychology, Vol. 1. p. 546. 
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> 


e.g. a series of greys, the contrasts or “distances” separating 
different pairs of intensities are perfectly homogeneous with 
one another and can be measured in terms of one another or in 
terms of an arbitrarily chosen unit of “sense-distance.” Given 
two brightness-intensities a and 6b, it is quite possible to find, 
within limits of error, a brightness-intensity c which is as 
much higher than 6 in the scale of intensities as b is than a, 
i.e. such that the sense-distance bc = the sense-distance ab; or, 
again, it is quite possible, theoretically, to find a brightness- 
intensity d which bisects the sense-distance ab, i.e. which is 
such that it is as far removed from a in the scale of intensities 


as b is from it—in symbols, ad=db. Hence the “ distance,” or 


——+_____f——__. 


a a d 6b € 


Scale of 
brightness-intensities. 


Fig. tl; 


disparity of b from a is twice that of d from a, the distance of c 


from a is four times that of d from a. If, now, ad, or the 
distance of d from a, be taken as a conventional unit, the 


values of ab and ac will be 2 and 4 respectively*. 


A scale of intensities may in this way be formed rising by 
“ equal-appearing intervals” or sense-distances, and the magni- 
tude of any given interval, may then, theoretically, be read off 
in terms of the unit-distance employed in the construction of 
the scale. In practice, however, it 1s found more convenient to 
fix the successive scale-marks, the successive members of the 
intensity-series, in terms of their corresponding stimulus-values. 
It has been found by experiment that in the case of light- 
intensities and sound-intensities the successive stimulus-values 
form, with fair approximation, a geometric progression, or, in 
other words, each stimulus value divided by the immediately 
preceding one gives approximately the same quotient. From 
the stimulus-values corresponding to an ascending series of 


* This view of mental measurement in terms of sense-distance first originated 
with J. R. L. Delboeuf: Revue Philosophique v. 1878, p. 58. His term for 
sense-distance was ‘‘contraste sensible.” 
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eight equidistant brightness-values, Ebbinghaus obtained the 
following series of quotients: 
2a. 22 wks Po VTi 20 

The quotient-value is not entirely constant, being slightly 
greater towards the two ends of the scale than it is at or about 
the middle. For this central region of the scale, then, the 
general relation of the sense-distances to the stimulus-values is 
given by the logarithmic formula : 


SS, = k log s SoeeE i Te ae (1)*, 
0 


where SS, represents a sense-distance, and S, is any finite 
intensity of sensation taken as the starting point or conven- 
tional zero (N.B. it is not necessarily lminal); &, A, are 
corresponding stimulus-values. 


A mode of procedure which not only admits of much wider 
practical application than the above-mentioned “method of 
mean gradations,’ but also possesses a peculiar historical 
importance, is that which is concerned with the determination 
of the stimulus-increments corresponding to just-noticeable 
increments of sensation-intensity in different parts of the 
intensity scale. Weber found, in a series of experiments chiefly 
with lifted weights, that this stimulus-increment was relatively, 
not absolutely, constant for different regions of the intensity 
scale, i.e. that the stimulus corresponding to any original 
sensation-intensity had always to be increased by a constant 
proportion to arouse a just-noticeable increment of the sensa- 
tion-intensity. If a 103 grms. wt. is just noticeably heavier 
than a 100 grms. wt., then the wt. just noticeably heavier than 
a 200 grms. wt. will be a 206 grms. wt., that just noticeably 
heavier than 300 grms. will be 309 grms., etc. 


Mathematically formulated, Weber’s Law is: 


ak a COMSH SMG) ones cedsemen -pindecee (2). 


kh 
an : 
The quantity ee 735 «or 03 in our example of weight- 


lifting—is known as the “relative difference limen” (DL). 


* Ebbinghaus. 
1—2 
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It is of course the average value of a considerable number of 
determinations. 

Fechner verified Weber’s Law in many different realms of 
sensation-intensity, and made it the basis of his own system of 
mental measurement. This he did by making the following 
three assumptions: 

(1) that a sensation-intensity is a measurable magnitude 
and may therefore be regarded as a sum of unit-intensities ; 

(2) that just-noticeable differences of sensation-intensity 
are equal at different parts of the stimulus scale, and may 
therefore conveniently serve as the unit-intensities above- 
mentioned ; 

(3) that the just-noticeable difference of sensation may be 
treated as a difference of two sensations, or at least that if 
Weber’s Law applies to the former (“sensed difference ”) it will 
also apply to the latter (“ difference sensation’). 

On the basis of Weber's Law and these added assumptions, 
Fechner obtains the following formula, viz. 


which he calls the fundamental formula for mental measure- 
ment. Integrating, this becomes 
S=clog,R+C. 
Putting R=r, the “stimulus limen” for which S is just below 
the threshold of consciousness, i.e. = 0, and we have 
0=clogyr+C. 


Subtracting, S =c log, = 


Putting r = 1, and transferring to the ordinary logarithm system, 
we get 
9S 6 LOG TR asa ne leas tesa eee (4), 

All the assumptions involved are questionable. The first 
one has already been considered at some length. It is not the 
single sensation-intensity which is measurable, but the distinct- 
ness, disparity or distance of one sensation-intensity from 
another. Any intensity taken by itself is 0, it is at a zero 


distance from itself, S,S,=0 [see equation (1) above]. 
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We must therefore regard the just-noticeable difference, 
not as a difference of two sensation-intensities but as a minimal 
sense-distance, if we are to be able to make use of it in our 
scheme of mental measurement. This modification, however, 
still leaves us involved in the difficulties of assumptions 
(2) and (8). 

Fechner’s own reason for regarding all just-noticeable 
differences belonging to any one scale of intensities as equal 
was that they appear equal to introspection. Introspection in 
a case like this is obviously difficult, even for the most skilful 
observers, and its verdict cannot, therefore, be greatly relied 
upon. Theoretically it is quite conceivable that just-noticeable 
differences, though equivalent to one another as being all just 
noticeable, 1.e. as being sense-distances so small that the 
shghtest diminution of them would cause them all, equally, to 
cease to be noticeable, yet as noticed or perceived would appear 
of different magnitude one from another. Ebbinghaus points 
to the analogous case of differentials or infinitesimals in 
mathematics. These are all equivalent to one another as 
being all equally negligible as compared with finite magnitudes, 
yet are by no means necessarily equal to one another. If they 
belong to different “orders,” those of a higher order are 
negligible as compared with those of a lower, etc. Again, “the 
least distances perceived as such at different parts of the skin 
or in direct and indirect vision do not by any means all appear 
as equal magnitudes. On the contrary, so soon as they come 
to consciousness as distances they are at once perceived as 
distances of varying size, in a certain approximation to their 
objective differences*.” In spite of these considerations, 
Ebbinghaus regards the correspondence of the stimulus results 
obtained for equal appearing intervals and for just perceptible 
intervals in the case of brightness-intensities (in the middle 
region of the scale both series of stimuli form a geometrical 
progression) as sufficient evidence for the approximate equality 
of the latter intervals. Miller and Wundt had previously 
advanced the same argument. 


* Ebbinghaus: Grundztige der Psychologie, 2nd ed., 1905, p. 524. 
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Several experimental investigations have been made with 
the express purpose of testing the relation of the methods of 
minimal change and mean gradations. Titchener* sums up 
the theoretical basis of such experiments concisely as follows: 
“There are in reality two possible ways of working. (1) We 
might take a series of R-values, corresponding to a series, say, 
of eight successive j. n. d. of S, and thereafter directly compare 
the two half-distances, of four j. n. d. each, and decide upon 
their equality or inequality. This would be a direct method of 
experiment: determine the #-differences as a—b, b—c, e—d, 
d—e, e—f, f—g, g—h, h—1, and then see if a—e is equal in 
S toe—2 Or (2) we might determine a few j. n. d. of S, at 
different parts of the f-scale, in order to establish the constancy 
of the relative DZ, and thereafter work with supralmuinal d., 
and decide whether the same uniformity holds. This would be 
an indirect method: it is the method indicated by the authors 
just cited [Miiller, Wundt, Kohler, and Tannery]. Let a—b 
and h —7 be ft-differences corresponding to the j. n. d. of S, and 
b—-a i-h 

= 
If the j. n. d. are equal, then the point e, which lies midway 


let there be constancy of the relative DJ, 1.e., let 


for S between a and 7, must be such that Sat or 


2 


e=Vai. Hither of these two methods would, presumably, take 
us to our goal. The experimental work would be exceedingly 
difficult. Liminal determinations are always and intrinsically 
difficult; and, further, the judgments passed upon j. n. d. and 
upon supraliminal d. are, even under the most favourable 


conditions, the expressions of radically different mental atti- 
tudes.” 


It should be mentioned here that the principal rival to 
Fechner’s hypothesis—the “difference hypothesis”—is that 
first formulated by Plateau, and generally known as the 


“quotient hypothesis.” Plateau adopted the psycho-physical 
formula | 


* “Hixperimental Psychology,” Vol. 1. Instructor’s Manual, p. ]xxvii. 


MENTAL MEASUREMENT 7 


on the basis of experiments by the method of mean grada- 
tions. This implies, in the place of Fechner’s fundamental 
formula, the formula 


in other words, it assumes that j. n. d. are relatively, not 
absolutely equal S-magnitudes. Although Plateau himself 
withdrew his formula later, the “quotient hypothesis” still 
remains as the rival of Fechner’s “difference hypothesis.” 


To return to the experiments. Merkel (1888) worked with 
brightnesses, pressures, and noises, and found that the R-value 
of the S bisectmg a supraliminal sense-distance was the 
arithmetic mean of R-values of the two terminal sensations, 
+4 
2 
seem to support the quotient hypothesis, although such an 
inference is not entirely free from objection. Angell (1892) 
worked with noise-intensities by the method of mean gradations, 
avoided certain sources of error present in Merkel’s form of 
procedure, and obtained results supporting the difference 


1e., in Titchener’s notation, e = , not Vai. This would 


hypothesis, e= Vai. 

A more thorough investigation was carried out by W. Ament* 
in 1900. He used a series of Marbe greys for the brightnesses 
and employed the direct method; for the noise-intensities he 
used a Fechner sound pendulum and worked principally by 
the indirect method. The result reached was that “the DL as 
a magnitude increases with increasing £&,” 1.e. it supported the 
quotient hypothesis. Titchener+ summarises Ament’s con- 
clusions as follows: 


“The result is illustrated by the following Figure. 


Fig, 2. 


* W. Ament: ‘‘ Ueber das Verhaltnis der ebenmerklichen zu den iibermerk- 
lichen Unterschieden bei Licht- und Schallintensitaéten,” Phil. Studien, xvi. 
1900, pp. 135 ff. 

+ Op. cn, pp. laxxy, lxxxyvi. 
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The values 7,, %, 73, etc. are R-intensities, arranged in 
ascending order: the distance between every pair of successive 
r represents a DL. The distances ™—r,, %%—Ts, ete., In- 
crease by equal increments. It is clear (1) that the distance 
rs—r,, though it contains the same number of j. n. d., will 
appear greater than the distance 7,—1,; (2) that the difference 
decreases, the nearer the compared distances approach to 7;,— 
so that a comparison of r,—r; with r;—r, may even evoke 
the judgment ‘equal’; and (3) that a comparison of 7—?, 
with 7,—1y will evoke a decided judgment of difference, although 
the two distances are alike j. n. d.” 

Ament’s work has not escaped criticism. A repetition of his 
experiments on brightness-intensities by Frébes* has failed to 
confirm his results. Ebbinghaus also has found, in careful ex- 
periments with rotating sectors, that just-noticeable differences 
in different parts of the scale of brightnesses are equal to one 
another. On the whole, therefore, the balance of evidence seems 
to be in favour of the “ difference hypothesis.” 


Fechner’s third assumption—see above, p. 4—brings us to 
the question of the interpretation of Weber’s Law. 

There are three general forms of interpretation : 

(1) the psycho-physical (Fechner), 
(2) the physiological (Miiller, Ebbinghaus, James, etc.), 
(3) the psychological (Wundt). 

According to (1), the logarithmic transition takes. place in 
passing from the physiological changes in the sensory centres of 
the cerebral cortex to the corresponding sensation-intensities. 
The chief objection to this view of Fechner’s is that Weber's 
Law is not exact. This same consideration supports (2), or the 
physiological view, according to which the transition occurs either 
at the inception of the stimulus in the sense organ, or somewhere 
between the nerve endings and their central connections in the 
sensory areas of the cortex. Experimental results obtained 


by Waller} and Steinach{ are in favour of this view. Waller 

* Zeitschrift fiir Psychologie, xxxvt. p. 344. 

+ Waller: ‘‘ Points relating to the Weber-Fechner Law,” Brain, Vol. xvi1t. 
p. 200, 1895. 

t+ Steinach: ‘“ Klecktromotorische Erscheinungen an Hautsinnesnerven bei 
adaquater Reizung,”’ Pfliiger’s Arch. Bd. 63, S. 495, 1896. 
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stimulated a frog’s eye with light of different intensities, and 
found that the corresponding “negative variations” set up in 
the optic nerve varied in intensity as the logarithm of the 
stimulus-values (approx.). Stemach obtained similar results 
on stimulating the skin of a frog’s thigh with weights and 
noting the negative variation in the attached nerve. If the 
negative variation may be assumed to be proportional to the 
intensity of the nerve-current passing along the nerve, these 
results point to the conclusion that the logarithmic transition 
occurs in the sense organ and its sensory nerve endings. 


The effect would thus be analogous to that of friction and 
inertia in machinery. Ebbinghaus* has constructed a theory 
based upon the conception of varying degrees of dissociability of 
complex molecules to account for the law and also for the devia- 
tions from it towards the two extremes of the intensity scale. 


The psychological view, (3), of the law, held by Wundt, 
regards it as a special case of the general psychological “law 
of relativity.” Stimulus, physiological process, and pure sensa- 
tion-intensity increase in simple proportion to one another. 
The logarithmic transition occurs in passing from mere sensation 
and sensation difference to apperceived sensation and apper- 
ceived sensation difference, and the intensities are apperceived 
always in relation to one another. In addition to the 
objection that it regards the sensation-intensities themselves, 
not their distances from one another, as measureable magnitudes, 
this view is also open to the criticism that it furnishes no explana- 
tion of the widely varying size of the relative difference limen 
in different sense-departments (e.g. DL for brightness-intensi- 
ties = ;1,, DL for sound-intensities = about 4); the physiological 
view sees in the varying structure of the different sense-organs 
the adequate explanation of this. Moreover, the psychological 
view has no completely satisfactory explanation to give of the 
deviations from Weber’s Law so frequently met with. 


These objections and difficulties make the view improbable, 
but by no means prove it to be impossible. It has the great 


* Ebbinghaus: ‘“ Uber den Grund der Abweichungen von dem Weber’schen 
Gesetz bei Lichtempfindungen,” Pfliiger’s Arch. Bd. 54, S. 118, 1889. 
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merit of emphasizing more definitely than was heretofore the 
case the importance of the more purely psychological factors 
in psycho-physical experiments,—in particular, it brings mto 
prominence the distinction between mere disparity of sensation- 
intensities present simultaneously or in immediate succession 
in the same consciousness and the perception of this disparity, 
the discrimination of the intensities one from the other. In 
psycho-physical experiments the subject’s consciousness is not 
limited to the mere sensational level. 


Although, with continuous increase of stinulus-intensities 
the corresponding sensation-intensities rise in “steps,” each 
representing a just-noticeable difference, the psycho-physical 
relation is really a strictly continuous one. This becomes at 
once obvious if we consider a special case. The sensation- 
intensity aroused in lifting a wt. of 100 grms. is indistinguish- 
able from that aroused by 102 grms., the sensation aroused by 
102 germs. is indistinguishable from that aroused by 104 grms. ; 
yet the sensation-intensity aroused by 100 grms. is perceptibly 
less than that aroused by 104 grms. Thus the sensation- 
intensity increases continuously, and the reason why this is 
not immediately apparent is probably to be looked for in 
the physiological mechanism of the pyscho-physical organism. 
Delboeuf held that the hmen has no psychological importance 
whatever. If this is an extreme view, the importance which 
Fechner attributed to the limen is equally extreme in the 
other direction, and farther removed from the truth. 


The absolute or stimulus limen, RZ, is similar in kind to 
the difference lmen, since consciousness is never empty of 
sensation-intensities when such a limen is being determined. 
Here again, Fechner’s rigid distinction of the two was a funda- 
mental error. 


Indirect Methods of Measurement. 

The preceding account has probably sufficed to show that 
purely psychical measurement is a possibility. Its practical 
application, however, has been more detailed than extensive. A 
more generally useful method in quantitative psychology has 
been found to be that which measures the external, physical or 
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physiological, causes and effects of mental process. The measure- 
ments are made in terms of the physical units of space and time, 
yet they are not merely physical measurements, since they derive 
all their significance from the correlated psychical processes. 
They are indirect psychical taeasurements*. Measurements of 
reaction-times, memory, fatigue, illusions, etc. are all of this 
nature. Their varieties are innumerable, and are illustrated by 
the accounts in any good text-book of experimental psychology 
(Sanford, Titchener, Myers). In all cases full introspective 
accounts are essential, and when correlated with the measure- 
ments make the latter essentially psychical measurements. 
Measurements of limina, referred to in the previous section, 
are of the same nature. They are of some special importance 
as being measures of sensory acuity, etc.—aspects of the total 
mental ability of psycho-physical organisms. They figure 
prominently in many researches based on the use of “mental 
tests.” 

A method which makes a partial return to the more purely 
psychical form of measurement in terms of “distance” is the 
method of ranks or grades. Suppose we are considering the 
relative abilities of, say, 100 boys in English Composition. We 
should find it difficult to mark their essays individually in 
terms of any constant unit but might find it possible to arrange 
them in order of merit, especially if we had sufficient time at 
our disposal to employ the method of “paired comparisons.” 
According to thé procedure of this latter method, the essays 
would be taken in pairs, quite at random, and the better essay 
of each pair would be given a “preference mark.’ This 
procedure would be repeated again and again until every essay 
had been compared with every other essay. The order of 
merit is then given by the number of preferences attaching to 
each essay. In this order, however, we cannot assume that the 
“ability-distance” from one boy to the next is a constant 
quantity. The boys near the extreme ends of the series will 
be farther removed from one another than the boys near the 
middle. We could only adjust for this if we knew the law of 
frequency-distribution for this kind of ability in this particular 

* See Ebbinghaus, Grundziige der Psychologie, 1905, pp. 75, 76. 
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species of boy, and theoretically the determination of this 
distribution depends upon a prior fixing of the psychological 
unit, the unit of “ability-distance”; so that strictly speaking 
the problem is insoluble. Since, however, under certain 
definite and indefinite conditions, the form of distribution in a 
large number of cases of physical measurement has been found 
to be Gaussian or “normal,” we might, with some probability 
of being near the truth, asswme this form of distribution in 
the given case and so obtain a quantitative measure for the 
ability of each particular boy. A direct psychological determina- 
tion and application of the (conventional) unit-distance is not, 
perhaps, an entirely impossible problem, and work in this 
direction may be expected in the future. The method, if 
achieved, would certainly be much more scientific and psycho- 
logical than the present one which measures in terms of external 
quantum of work done. 


Finally, the interrelations of different mental abilities 
within any well-defined group of individuals situated within 
any definite environment may be determined by means of 
the technical method of “correlation.” A correlation coefficient 
or other constant (e.g. correlation ratio) measures the tendency 
towards concomitant variation of two mental (or other) abilities 
within a group of individuals. The result may be transferred 
to any single individual within the group as measuring the 
degree of probability of connection or the closeness of con- 
nection of the two abilities in the particular case. The 
correlation between the two abilities may be due to an actual 
direct relation of the abilities to one another, or, indirectly, to 
the influence of a common external environment upon them 
both. The first of these two cases is perhaps the, more 
important, but the possibility of the second should not be lost 
sight of, and it also has a special interest of its own. The 
problem of correlation will be considered more fully in a later 
chapter. 


CHAPTER II 
THE PSYCHO-PHYSICAL METHODS 


The experimental determination of absolute and difference 
thresholds or limina is complicated and difficult. A considerable 
number of physical and psychological factors are involved, of 
varying relative importance in different cases. The result is 
that different methods of procedure have been found most 
suitable for different cases. These methods have been 
traditionally grouped under three (or four) distinct headings, 
and called the Psycho-physical Methods. They are: 

(1) the Method of Limits (Method of Minimal Changes), 

(2) the Method of Average Error (Method of Production), 

(83) the Constant Method (Method of Right and Wrong 
Cases). i 

A fourth method is generaily added to the list, viz. : 

(4) the Method of Equal-Appearing Intervals, or Method 
of Mean Gradations, but this is really no new method. It 
owes its special name to the nature of the task which it fulfils, 
viz. the determination of equal-appearing (iibermerklich) sense- 
distances as distinguished from just perceptible (ebenmerk lich) 
sense-distances. The method which it employs falls under one 
or other of the first three headings. 


1. The Method of Limits. 
In using this method for the determination of difference 
limina the following mode of procedure is adopted. The 
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variable stimulus, V, is first made equal or slightly larger than 
the standard, S, and then gradually increased by small increments 
until the subject finds it just noticeably greater than S. V is 
increased still more, and then gradually diminished until it 
just ceases to appear greater than S. The mean of the two 
values of V —WS thus obtained is the upper difference limen, Dy. 
Four values, instead of two, might also be obtained, viz. for 
(i) V just not perceptibly greater than S, (11) V just perceptibly 
greater than S, (i) V just perceptibly greater than S, Gv) V 
just not perceptibly greater than S, (4) and (1) being for 
ascending values of V, (111) and (iv) for descending. D,, will be 
the mean of these four values of V—S. 

The lower difference limen, D;, 1s obtained in a similar way. 
Both limina may be obtained in the same series of experiments 
by the “method of complete descent and ascent.” 


A series of determinations of each limen is made and the 
average taken. The reliability of this average measure is 
conveniently given by the “mean variation” (M.V.), which is 
the average of the deviations of the individual determinations 
(these deviations all being reckoned as positive) from their 
average. If there are nm individual determinations 4, a, ... ®p, 
with average #, then 
(© ~ #,) + (@ ~ a) +... + (& ~ 4p) 

n 


) 


M.V. = 


M.V. thus measures the “scatter” of the individual measures 
about their mean. Other measures of “scatter” are the “measure 
of precision” (h) (inverse), the standard deviation (c) and the 
“probable error” (P.E.). The general question of the measure 
will be considered later on in this book. 


9) 


of “scatter 

The number and size of the increments employed must be 
adjusted to the particular conditions of the experiment. The 
subject of the experiment should be given a certain amount of 
preliminary practice before being started upon the work, and 
introspective reports should be asked of him. 

Among the various possible sources of error which deflect 
the subject’s judgment, two are of special importance. They 
arise from the temporal and spatial arrangement of the com- 
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pared stimuli, and are called the “time error” and “space 
error” respectively. Thus, in a determination of the difference 
limen for sound-intensities, the two stimuli, S and V, cannot 
be presented simultaneously. One must precede the other, 
and in this way it may produce a slight degree of fatigue 
which causes an over-estimation of the other, or it may produce 
the reverse effect of sharpening the attention to the second. 
Thus a time error arises. Again, in experiments with brightness- 
intensities or visual extents, where the stimuli can be presented 
simultaneously, a space error arises from the fact that the one 
stimulus must be presented either to the right or to the left of 
the other stimulus and the subject’s judgment varies accordingly. 
In experiments with lifted weights both sources of error may 
be involved. These so-called constant errors may be ap- 
proximately neutralised by arranging that in the course of 
the experiment the standard shall precede the variable or stand 
to the right of the variable in half the cases and follow or 
stand to the left in the other half—the time or space order, or 
both, being of course changed quite at random in the successive 
limen-determinations. <A better plan is to evaluate the liimen 
(and its M.v.) for each time and space order separately. This 
gives us the values of the time error and space error, which are 
of interest for their own sakes. Fechner’s theory of these errors 
and their measurement is based upon the assumption that the 
time and space order of the two stimuli, S and V, exert an 
influence upon the result which is equivalent to a definite in- 
crease or diminution in the stimulus-value of one or the other, 
and thus increase or diminish the value of S~ V by the 
amount of the time error, p, or that of the space error, gq, or 
that of both together, p + q, which = ¢, the constant total error. 
The time error, p, 1s positive or negative according as the effect 
of the time order is to increase or diminish the apparent value 
of the first-presented stimulus. The space error, qg, 1s positive 
or negative according as the effect of the space order is to 
increase or diminish the apparent value of the left-hand 
stimulus. 

Four principal cases of time and space order are possible, 
and are conventionally numbered as follows: 
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I. Standard presented first and to the right, 


IL. FE F second. |..." %,, . 
III. . ; first 5) gee Leet 
10g 4 . second ,, _,, . 


Employing these numbers as suffixes, we have the equations* 
2 Dy = Duy + Duw = Dun + Durr 
2D, = Dyit+ Dav= Din + Dim 
2¢e, = Dy — Dav =Duw— Dut 
2 ¢g = Din — Dim = Durr — Duns 


In the case where a difference of time order alone is 
possible, the above equations reduce to 


2), = Dirt Daa 
2D, = Di + Din 
2p = Duy — Dun = Din — Du. 
Corresponding equations are valid for the space error q, in 
a case where a difference of space order alone is present. 


The errors due to expectation, habituation, fatigue, etc., are 
neutralised or at least reduced to a minimum by determining 
the limen by means of both ascending and descending values 
of V and averaging, and by other special precautions in applying 
the method. 


A modification of the method which effects a more complete 
elimination of the expectation error by the use of occasional 
“catch experiments,’ where the variable given is equal to the 
standard, is that known as the Method of Serial Groupst. 


Each fixed value of V is presented with the standard ten 
times, not in immediate succession but interspersed at random 
among ten other values of V equal to S. The percentage of 
correct answers given by the subject is noted, and the experi- 
menter passes on to the next value of V which is presented along 
with catch stimuli in a similar manner. The value of V which, 
as presented in this way, gives 80°/, right answers is arbitrarily 
chosen as measuring the limen. This method combines with 


* G. E. Miller: Die Gesichtspunkte und die Tatsachen der Psychophysischen 
Methodik, 1904, pp. 67, 71. 
+ See C. S. Myers: Textbook of Experimental Psychology, p. 209. 
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the principle of the method of limits that of the constant 
method (yet to be described) and is a very convenient one to 
use In measuring a large number of subjects in “ mental test ” 
experiments, where economisation of time is essential. 


2. The Method of Average Error. 

In this method the subject is required to adjust a variable 
stimulus, v, so that it seems subjectively equal to a given 
standard stimulus, s. The experiment is repeated a large 
number of times—at least 100—and the arithmetical mean, v,,, 
of all the obtained stimulus-values is calculated. The difference 
between this mean value and the standard stimulus is known 
as the crude constant error, e. It may be either positive or 


S (Um ~ V) 
n 


negative. The mean variation, M.v.= , 18 also cal- 


culated. 

The crude constant error may be partly due to a space error 
(the time error cannot occur in this method), partly to other 
constant conditions*. Let us assume that the experiment is 
to adjust the length of a variable line until it seems to be 
equal in length to a standard line. In this case, the standard 
should be situated to the right in one-half the number of 
adjustments and to the left in the other half, either alternately 
or in haphazard order. The results are tabulated in two 
columns, I standard to right, II standard to left, and om 1, omy 
are calculated from these two columns separately. Then, if g be 
the required spaée error and k the residual constant error, we 
have 

Umit ~-S=—Gt+kh, mur—s=+q+h, 
9) Bia CF jp — (mT Umi _ 
2 ; 2 
In order to give as much definiteness as possible to the 
task of adjustment, the variable should start sometimes shorter 
than the standard, sometimes (an equal number) longer, and 
the adjustment be made by lengthening and shortening re- 
spectively. Again, the requisite amount of lengthening or 
* Cf. E. B. Titchener, Experimental Psychology, Vol. 11. Student’s Manual, 


1.e. 


p. 74. ‘‘A ‘constant’ error is simply an error whose conditions are constant; 
its amount may vary, quite considerably, from stage to stage of a long series of 
experiments,” 


B. 2 
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shortening should be arranged to be different on different 
occasions, but alternating with some degree of uniformity. 


The value of M.v. (mean variable error) obtained in this 
method is from a general point of view a more important result 
than the value of the constant errors, since it has often been 
regarded as proportional to the value of the difference threshold 
as determined by the other two psycho-physical methods. The 
truth is that although there is a certain amount of proportionality 
between the values, this proportionality is not complete. The 
correlation of mean variable errors and corresponding just 
noticeable differences is less than unity. Under certain con- 
ditions the two values vary in opposite directions. It is hardly 
necessary to point out that the M.v. obtained in the Method of 
Limits has but little relation to the mean variable error obtained 
by the Average Error Method. They are not entirely unrelated, 
however. 


The closest correspondence of any is that between the 
“frequency distribution” of the errors in the present method, 
and that of the judgments “equal” obtained by the Constant 
Method. 


3. The Constant Method. 


This is generally considered to be the most satisfactory of the 
psycho-physical methods. It can be employed with equal conveni- 
ence for the determination of absolute thresholds, difference thres- 
holds, equally-appearing sense-distances, and other measurements 
of psychological importance. The different values of the variable 
stimulus to be employed are fixed once for all at the beginning 
of the investigation, and are presented to the subject a large 
number of times (say 100 applications of each variable stimulus 
value) in irregular order, or in a prearranged order, unknown to 
the subject, corresponding to certain precautions to be mentioned 
later. Ifan absolute threshold is being determined, the variable 
is presented alone, if a difference threshold, it 1s on each occasion 
preceded, accompanied or followed by the standard stimulus. 
In the latter case, the subject returns the replies, “ greater,” 
’ or “equal,” “less” with reference either to the 
standard, or to the variable, or on some other prearranged 
principle. The percentage of each of these three types of 


? 


“ uncertain ’ 
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answers is determined for each value of the variable employed, 
and recorded. The required threshold is taken as that corre- 
sponding to the variable which evokes 50°/, objectively correct 
answers, and consequently also 50°/, of incorrect and “uncertain” 
(or “ equal”) answers. 


Taking as an example of the determination of an absolute 
threshold that of the “spatial threshold,” we may say that it is 
given by 50°/, two-point judgments; and as an illustration of 
the detailed working of the method, we shall find it convenient 
to refer to a series of results obtained by Riecker (Zeitschr. f. 
Biologie, Bd. 10) which has already served in the descriptions 
of the method given by Miiller, Titchener, and Myers. Riecker 
obtained the following results in an investigation of the 
“spatial threshold” or threshold of two-point judgments of the 
skin of the lower eyelid: 


D (distance between the 
points of the aestheso- 
meter in Paris lines*) 


Z (°/, of two-pt. judgments) | 30 | 10 | 14 | 40 | 65 | 80 | 87 | 96 | 100 


(—20) (4) (26) (25) (15) (7) (9) (4) 


It will be observed that, with two exceptions, the series of 
percentages follows a general law of increase, the rate of 
increase itself increasing at first and then diminishing. The 
numbers within the brackets are formed by subtracting each 
percentage from the immediately succeeding one, and show 
this uniformity more clearly. The two exceptions are at D=0 
and at D=5. At D=0 the percentage is greater than at the 
immediately succeeding interval. This irregularity is known 
as an inversion of the first order. The cause of it is doubtless 
to be looked for in the exceptional way in which the stimulus 
(one point) may have been applied, the pressure or general 
nature of the contact may have been different from what they 
were with two-point contact, or some misleading suggestion 
may have accompanied these particular experiments. 


* A Paris line= 2°25 mm. 
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At D=5, the percentage is indeed larger than that for D = 4 
and smaller than that for D=6, but reference to the bracketed 
numbers shows that the increase from 4 to 5 is greater than 
the increase from 3 to 4, thus breaking the general rule as 
regards rate of increase. This irregularity is known as an 
inversion of the second order, and being in the present case 
slight is probably to be explained as due to an insufficiency in 
the number of applications of the stimulus. 


It is important to realise clearly the fact, which G. E. Miiller* 
was the first to point out and emphasize, that a limen is a 
variable magnitude following a certain law of frequency- 
distribution+. There is no fixed limen, only an average limen, 
a most frequent limen or a most representative limen. The 
percentages in Riecker’s table represent the relative frequencies 
of limina for distances between the compass-points less than 
the corresponding D’s. Thus there are 65°/, limina less than 
2 Paris lines, 80 °/, limina less than 3 lines; ie. there are 
80 — 65, or 15 °/, limina between the limits of stimulus values 
2 lines and 3 lines. 


This suggests the plan of plotting a “frequency-polygon ” 
for the limina. For the present case (omitting the anomalous 
percentage value for 0) the frequency-polygon is given by the 
accompanying figure (Fig. 3). 


Hig, 3. 


* This, at least, is the view to which Miiller himself inclines (op. cit. p. 59), 
but he deduces his formulae on the assumption, supported by Fechner and 
Bruns, that the threshold has a single definite value, D, subject in the course of 
an experimental determination to variable apparent increase or decrease by 
random influences, +6, which obey the Gaussian ‘‘ Law.” He points out that 
both views lead to the same formulae. 

+ The term ‘“frequency-distribution ” is here used in a loose sense. Its 
technical significance involves the conception of ‘random samples” taken from 
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The area of each rectangle represents the percentage or 
relative frequency of the limina lying between the stimulus- 
limits indicated by the abscissae of the feet of the two perpen- 
diculars forming its vertical sides. Thus the area of abcd is 15. 
The irregularity of the frequency-polygon or “curve” is probably 
due to the relatively small number of observations recorded. 
Taking the limit of an exceeding large number of (hypothetical) 
observations or cases we might have obtained one or other of 
the two general types of frequency-curve depicted in Fig. 4: 

PRS 


M NT 

Fig. 4. 
A is a symmetrical curve, B is known as a skew curve. In 
both cases the distribution of cases (spatial limina in the 
present instance) is represented by the area enclosed by the 
curved line and the horizontal line or axis of abscissae. 


There are three ordinates of a frequency-curve which are 
of special interest. They are: 


1. the maximum ordinate, whose abscissa 1s the mode; 

2. the centroid vertical, the ordinate passing through the 
“centre of gravity” of the surface, whose abscissa is the mean ; 

3. the ordinate dividing the surface into two equal areas, 
whose abscissa is the medzan. 


a ‘‘total population,” and is explained more fully in the last section of this 
chapter. The description on pp. 21, 22 really belongs to this latter account, but 
is inserted here because necessary for a proper understanding of what follows. 
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In symmetrical curves, these three ordinates coincide. In 
asymmetrical or skew* curves they do not. Thus, in Fig. 4, B, 
PM is the ordinate corresponding to the mode, RN that 
corresponding to the median, and SZ’ that corresponding to 
the mean. The abscissa of any one of these three ordinates 
would give us a definite representative liminal value. In 
fixing upon 50 °/, two-point judgments as the measure we 
choose the median. Where interpolation is justifiable, Le. in 
results where the number of limina determined is sufficiently 
large, so that a continuous curve can be found to “fit” the 
results with sufficient closeness, the mode of this curve is the 
best measure since its abscissa gives the most probable or most 
frequent limen. 


The “degree of scatter” of the individual limina about the 
modal, mean, or median limen can be measured either by the 
“standard deviation,” o, or by the “mean variation,” M.V. 
Restricting our attention for the moment to the symmetrical 
curve, where mode, mean and median coincide, we have as the 
value of a, the square-root of the mean of the squares of the 
deviations of the wmdividual values from the mean (mode or 


median) of all the values; or in symbols, ¢ = yy o , where 


N is the total frequency (total number of limina), # is the 
deviation of any individual value from the mean, f the frequency 
of each group of equal values and S is the symbol for summa- 
tion (Le. S( fa?) = fia? + foxe+...+fyey?). The mean variation 
has already been explained (p. 14). Another measure, called the 
“probable error,” P.E., 1s In common use. It=°'67449 oc, and 
when employed in a general way, ie. for all kinds of skew and 
all kinds of symmetrical curves, may be regarded as merely a 
conventional reduction of the standard deviation. It only has 
a peculiar significance of its own in the case of the Gaussian or 
“normal” curve, in which it corresponds to the ordinate which 
divides either half of this symmetrical curve into equal areas, 
so that half the total number of cases are included within 


Mean-Mode 


ey = : 
ee Standard Deviation 
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the limits + p.E. In Fig. 4, curve A, the area of the curve 
between the ordinates ef and gh would be half the entire area 
if the curve were a “normal” curve. 


In the case of a skew curve, these measures of “scatter” can be 
taken about any one of the three “ central tendency ” measures. 
They are also different for each half of the curve, although it 
is usually sufficient to lump the two values together. 

It is most important to distinguish carefully the measure 
of “central tendency” from the measure of “scatter.” Both 
should be evaluated in any piece of research. The second is 
a measure of the reliability of the first, or at least of the 
variability of the individual values of which the first is the 
average or representative value. 

To return to Riecker’s table. The limen corresponding to 
50 °/, two-point judgments may be calculated in two general 
ways: (A) from the observed values, (B) by finding the best- 
fitting smooth curve, adjusting the observations to this, and then 
calculating the constants (mode, etc., a, etc.) from the curve. 


A. The required limen obviously falls somewhere between 
15 lines (40 °/, two-point judgments) and 2 lines (65°/,). To 
determine its value roughly we may substitute in the following 
formula, devised by Wundt in another connection : 


1p — Da (50 — Zs) + Dy (Za — 50) 


° Ly — Ly 
where D,= distance giving Z,°/, two-point judgments 
D= , e Lele. we » 
and Z, 1s the percentage just above 50, in the table, 
Lies 55 . > melo D0," a ss 


Substituting, 
_ 2(50 — 40) + 15 (65 — 50) 
~ 65 — 40 


=1‘7 Paris lines. 


D 


This method is open to several objections : 


(1) It does not employ all the data; it uses two of the 
percentages only. 
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(2) It assumes that between the values D, and D, the 
frequency-curve runs parallel to the axis of abscissae. 


(3) It gives no measure of scatter. 


| A value which can be determined by a use of all the data 
is that of the mean or average limen. The formula to be 
employed is, of course, the well-known formula for the mean, 


_S(f2) 
igen ar 
where x = abscissa of centre of base of any column, 


f=area of corresponding column, Le. the relative 
frequency, 


and N = total relative frequency. 
Applhed to our example, this gives 


we 75x 441°25 x 264-175 x 25425x1543:5x7445x9+55x4 


44+264+25+154+749+4 
203°75 
=—99 = 2:26. 


It has been recently suggested by Dr Spearman* that the 
most representative ordinate of a column (that about which the 
limina of the column may be most accurately assumed to be 
concentrated) is not the mid-ordinate, as is assumed in the 
above formula, but an ordinate shifted slightly towards the 
higher of the two neighbouring columns (see footnotet). He 
therefore proposes to replace f/ of the preceding formula by a, 
where 


oc) ee fers + Ofte + fe 


3 Feat fat fra 


* The Method of ‘‘Right and Wrong Cases” (“‘Constant Stimuli”) without 
Gauss’s Formulae, Brit. Journ. of Psychology, Vol. 11. 1908, pp. 227—242. 

+ Figure given by Spearman, op. cit. p. 241 (slightly simplified). 

The mid-ordinate c, is replaced by the ordinate a, which passes through 
the centroid of the trapezium DyypypiiDry- 


yy is the mid-point of af, {=4 (hat) 


Yr+ ” ” ” Ofetis 


Jp= average relative frequency for kth column. 


THE PSYCHO-PHYSICAL METHODS 25 


There is clearly some serious misconception here, since 1t 
has long been known that the mean of any frequency-distribution 
needs no correction*. <A glance at the proof (see footnote) 
reveals the source of the error. Spearman replaces each rect- 
angle by a trapezium of different area and then multiplies the 
abscissa of the centroid vertical of this new area by the old 
area, which area, however arranged, could not possibly have 
the centroid of the new one. The unadjusted value is the best 
value of the mean, but it is doubtful whether this is of much 
significance as a measure of the limen. 


B. Assuming “normal” distribution (Fechner, Miiller). 


The classical description of this method is given in 
G. E. Miller, Die Gesichtspunkte und die Tatsachen der psycho- 
physischen Methodik, pp. 45—50, 56—58, and the following 


This gives at once the abscissa 


Di4i—Dz 
xy dx 


re — Dr : 
y dx 
0 


which reduces to the required formula, 


ae ae Dita = Dy feat Bfet Sea 
3 Ses t Bette 


= D;,+ 


oe D,. Cy Ay, Dist Di+e 


* Cf., eg., W. F. Sheppard: ‘On the Calculation of the most Probable 
Values of Frequency Constants, etc.,’’ Proc. Lond. Math. Soc., Vol. xxrx. 358 ff.; 
and also below, p. 38. 
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account aims at being merely a condensation of this description. 
But first a few words of preliminary explanation. The form of 


-DH<—— O Se) 


Fig. 5. 


distribution of the individual limina is assumed to approximate 
with a fair degree of closeness to the so-called Gaussian or 
normal curve, or “curve of error,’ the equation to which 1s 
generally written in the form 
a 2 ha 
OG 
(6 is sometimes put in place of «, to represent “error”: see later). 


A more convenient form, however, is that employed by the 
biometricians, v1z. 


No es 
= ere e 2 o2 ; 
: Vr .o 
in which o, as usual, = the standard deviation 
and NV = total number of cases (area) of the dis- 
tribution. 


N may be taken as unity, in which case the correspondence of 
the two forms gives us the relation 

Sok 

oe 
Thus / is an inverse measure of the “scatter” of the frequency- 
surface. It is called the “ measure of precision.” 


Its relation to the mean variation M.V. (sometimes known 
as the average deviation, A.D.) is given by the equation 
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Thus, again, in the case of the normal curve, 


wae <<? 
c= \/ 2° M.V. = 1°2538 M.V. 


‘4769 


Finally, P.E. = 67449 o = j 


Obviously the mode, mean, and median coincide. 


The ordinates corresponding to «= +o cut the curve in the 
two points of inflexion (1.e. the points where the curve changes 
from concavity to convexity, or vice versa). 


The mathematical deduction of the equation of the pro- 
bability curve is based upon the general theory of independent 
probability. 


It is developed from the binomial expansion of (p+ q)”, viz. 


pr bmp ag SOD prose + ED pg npg +9" 
the terms of which represent the distribution of chances of n 
independent events, where the probability of occurrence of each 
single event in one trial is p and the probability against its 
occurrence is g(1e.p+gq=l1). The first term is the probability 
that all will happen, the second that n—1 will happen and 
1 fail, the third that n—2 will happen and 2 fail, and so on. 
The single events must be independent of one another. Thus 
if n teetotuths are taken, each of which is marked in the ratio 
of p to g, so that the chance that any teetotum will 

fall on the marked piece is p, then the distribution p 
of chances when these n teetotums are spun NV < j 
times is given by the terms of the expansion, 

N(p+q)". In the stock example of coin spinning, = Fig. 6. 


P=Q= t- 
It can be proved that the Gauss curve is an extremely close 
approximation to the binomial frequency polygon 


(1) whatever be the value of n, if p=q=4, 


(2) with inequality of p and q, provided that neither p nor 
g is very small, and that n is very large. 
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It is not in agreement 

(1) if p or q is very small, 

(2) if there is correlation between the events or contri- 
butory causes, 1.e. if the events are not completely independent. 


The standard deviation () of the binomial frequency = Vnpq, 
that of the Gaussian = V(n + 1) pg. 


The above brief account must suffice for the theoretical 
discussion of the Gauss curve*. Returning to Miiller’s method, 
we find it assumed that the successively increasing percentages 
of two-point judgments correspond to the increasing area of the 
portion of a Gaussian frequency-distribution?t included between 
a limiting ordinate OP (see accompanying figure), the axis of a, 
the curve and the moving ordinate pD. Thus the area Pp,D,O 
represents the percentage of two-point judgments given for an 


pb 2B 
1 p, 
P 
D.. De. Os 
Fig. 7. 


objective distance between the compass points represented by 
the abscissa OD,, the area Pp,D,O represents the percentage 
for a distance OD,. Our task is to determine the distance OD, 
which gives 50 °/, two-point judgments, represented by half the 
frequency-surface, viz. the area Pp,D,O. Let us call the average 
limen (represented by OD, in the figure) S, and any particular 
stimulus-value employed, D. Then, in a sufficiently large 
number of experiments with D >S (e.g. = OD, in figure), so that 


* For a very clear elementary exposition, see Titchener, Haperimental 
Psychology, Vol. 11. Student’s Manual, pp. 38—55, on ‘‘ The Law of Error.” Also 
Merriman, Method of Least Squares, pp. 6—25. 

+ As stated above, p. 20, footnote, there is, strictly speaking, no justification 
for treating these percentages as parts of a ‘“‘ frequency-distribution.” This will 
be made more clear later on. 
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the percentage, Z, > 50, taking the whole area of the surface 
as 1, we have 
Z=area Pp,D;O = area Pp,D,O0 + area p,p,D,D, 
w h ia —h?§? dé 
ae te 
or, putting hd =¢, 
1 s(D-s)a 


ge ee ee i ePdt. 


0 
Similarly, with D< S (e.g.= OD, in figure), we have 
Z=area Pp,D,O = area Pp,D,O — area p,p.D,D, 


ay Lf! mwas 
Vor / —(S—D) 
r 0 
= 4 SS | edt. 
Vor / —(S—D)h 


Turning now to our experimental results, if we omit the 
first and last series (D=0 and D=6) for reasons previously 
given, viz. that 30 °/, for D=0 is an inversion of the first order, 
and 100 °/, for D = 6 shows that the limits of fluctuation of S have 
been passed, and call the other D’s D,, D,,....D;, and the corre- 
sponding percentages Z,, Z,,...Z,, then the values ¢,, t,...t, 
corresponding to the Z’s are given by Fechner’s Fundamental 
Table (see Appendix I, Table I); and we have the equations: 


- i, =(D,—S8) h, 
210, Sh, 
i=(0,= S)h, 


which, if ¢,, t., ete. were “observed” values, could be solved for 
S and h by the Method of Least Squares. 


But these ¢-values are not observed values; they are 
theoretically deduced from the observed Z-values. Hence we 
must, as it were, change them into observed values by “weight- 
ing” them. They are to be weighted both by w’, a weight 
proportional to the number of observations upon the basis of 
which the corresponding Z’s were determined and also by w” 
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to change them into observed values. Thus ¢, 1s to be multi- 
pled by w,'w,”, t, by ww, and so on. Since, however, the 
number of observations is almost universally equal in each 
series of experiments (giving the different Z’s), w’ is in almost 
all cases =1, so that w” is alone needed. This is given by 
Miiller’s Table of Coefficients of Weights* (see Appendix I, Table 
II). We thus have the equations: 

wt, =(D,—S)h, 

slo = (Des) Ds, 


ceocoeor ee eee ee rere ee ree 


Wt, = (D, — Sj h, 
which, solved by the Method of Least Squares, give the 


“normal” equations: 
> (D?w) .h—& (Dw). Sh = > (Diw), 
- 5 (Dw).h+>(w).Sh=— = (tw), 


where © (D?w) = (Dw, + DZw,4+ ... + D?w,), and similarly with 
the others. 


Titchener has worked out Riecker’s results by this method, 
and gets 
h=OQ40, S=1S8: 
In Fechner’s original discussion of this method, there was 
no sufficiently clear distinction made between the quantities 


1 
S and h. It was assumed that S and — measure much the 


h 

same quantity, 1e. that the product Sh is a constant. Miiller 
was the first to distinguish clearly between the two measures 
in psycho-physics and to point out that Sh is by no means 
always a constant quantity, although approximating to it in 
certain cases. S and hf do not always vary inversely with one 
another, in a series of determinations of a particular threshold. 
In some cases they show a tendency to vary directly. 


The same method is applicable to the determination of 
a difference-limen. In Miiller’s form of the method the most 


* A more correct table is that suggested by F. M. Urban, which is also 
given in Appendix I. 
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representative limen is taken to be that corresponding to 50°/, 
“right answers” (and therefore also 50°/, wrong+ uncertain or 
“equal” answers). The formulae required are: 


ot peat 
dt, 


(=D - ie et #2 
m+ Fel, 


=i (DIn= D)hi 


oo 
ie 2 


eds 


(+ D=D Lu) hu 
ho 
100 
certain, DL, = upper difference-limen, DI, = lower difference- 
limen, D = any particular stimulus-difference (V — S) employed, 
and is + *° or — * according as the variable, V, is greater or less 
than the standard, S. 


hy, h; are the measures of precision of the upper and lower 
limen, respectively. 


where g= answers “greater,” /=less, w=equal or un- 


The results—DJL’s and h’s—are obtained, as before, by 
means of Fechner’s Fundamental Table and Miiller’s Table of 
Coefficients of Weights*. 


Where time-errors or space-errors}+ or both are possible, 
the values of DZ and h are determined separately for each 
time or space order. In these cases the distributions often tend 
to be “skew,” even when the results all lumped together give 
an approximation to the Gaussian curve. Indeed Miiller him- 
self emphasizes the unjustifiability of the assumption that 
all distributions obtained by the constant method must be 
normal or even must be symmetrical, and shows that for certain 
of Merkel’s resultst a ‘“‘zweiteiliges Gausssches Gesetz” gives 
a very good fit. This “bipartite” or compound Gaussian law 
differs from the ordinary one simply in having two h’s or 
measures of precision, one for the positive deviations of the 
individual liminal values from their central value, and another, 
a different one, for the negative deviations. For a brief 

* Better, Urban’s Table. 


+ See above, pp. 15, 16. 
+ G. E. Miller, op. cit. pp. 92, 93. 
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criticism of the law of distribution suggested by Bruns, and 
known by his name, the reader is referred to Miiller, op. cit. 
pp. 90, 91. The whole question, though important, is too 
difficult to come within the scope of this book. 


Correspondence of Results obtained by the Three 
Psycho-Physical Methods. 


It has already been pointed out that the average error or 
mean variation of the Method of Average Error corresponds 
closely to the mean variation obtained from the distribution 
of “equal” or “uncertain” answers in the Constant Method. 
The relation existing between determinations of liminal-values 
by the Limiting Method and Constant Method, respectively, 
is more complex, and may be discussed from two somewhat 
distinct points of view, viz. the mathematical and the psycho- 
logical. Assuming identical or approximately identical psycho- 
logical conditions in the employment of the two methods, the 
result (value of the lmen) will be the same if the chance 
values of the limen form a normal frequency-distribution, 
different if this frequency-distribution is skew. In the latter 
eventuality the limen given by the Limiting Method will, it is 
clear, be greater or less than the limen given by the Constant 
Method according as the frequency-distribution has its mode 
towards the left (zero end) or towards the right. In the 
ordinary forms of the two methods, however, the psychological 
conditions must be very different. They can be approximated 
to one another by getting the subject, in the Constant Method, 
to reply “much larger,’ “much smaller” as well as “larger,” 
“smaller*.’ In calculating the result only the last two series 


22 GG 


of answers are employed. 


It has generally been held hitherto that the Constant 
Method gives more reliable results than the Method of Limits, 
but F. M. Urbant has recently brought forward reasons for 


* Cf. Ebbinghaus: Grundztige der Psychologie, 1905, p. 89; but see Miiller, 
op. cit. p, 160. 

+ F. M. Urban: ‘‘On the Method of Just Perceptible Differences,” Psycho- 
logical Review, Vol. xiv. 1907, pp. 244—253. See, too, Vol. xviz. 1910, ‘‘ The 
Method of Constant Stimuli and its Generalizations,” pp. 229—259, by the 
same author, 
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believing that, under certain conditions of experimentation, the 
converse 18 the case. He employs the following formula for 
the measure (7) of the limen: 


M a5 (PN +rPN +... + TPaN)=n1P,+17,P.+ Fog ey 


where 7,<72,<... <7, are the successive values of the variable- 


stimulus (V) employed, and 


fig = is 
Jee NP, 
Jy NYQe2Ps3; 


Pr = Ne «++ In-aPn» 
where, again, the p’s and q’s need to be explained ; thus 


px 18 the probability that in the comparison of the stimulus 7, 
with the standard the judgment “greater” will be given, 


and 


qz 18 the probability that a judgment will be given which is not 
a “greater” judgment, and therefore = 1 — p,. 


NV =the number of series of experiments made. 


This modified form of the method evidently approximates 
closely to the Constant Method. The p’s are the percentage- 
values obtained in the latter. The P’s are more reliable, 
though smaller, than the p’s. P, is a measure of the component 
probability that 7, is judged “greater” and that on all the 
smaller r(7,... 7%.) Judgments are given which are not “greater” 
judgments. Since the value of the P’s is not changed by the 
order in which the r’s are presented, these latter may be 
presented in any order, thus eliminating the influence of 
expectation, and bringing the method very close indeed to the 
Constant Method. Another important point is “to vary the 
steps ‘by which one approaches the threshold,’ because other- 
wise one can not make the supposition of a symmetrical 
distribution” (p. 247). Urban compared the results obtained 
by means of his formula, using lifted weights, with the results 
obtained empirically by (as it were) directly observing P,, or 

B, ° 
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something proportional to P,, viz. N;, “the number of times 
it occurred that each weight [in this case, 7;,] was the lightest 
weight of the entire series to be judged ‘heavier’” (p. 249). 
The agreement of observed results and theoretical results was 
found to be very close in all the cases. The article concludes 
as follows: “The theoretical basis of the method of just 
perceptible differences is the same as that of the error method, 
namely empirical determinations of the probabilities of judgments 
of different types on given differences of intensity. The result 
of the so-called method of just perceptible differences is that 
amount of difference for which there exists the probability one 
half that it will be recognised ”* (p. 258). 


One aspect of the psycho-physical methods has been scarcely 
mentioned in all the preceding description, viz. the attitude 
of the subject. Its importance is hardly likely to be overlooked 
in the present state of opinion in psychology. It 1s superfluous 
to emphasize the need of careful introspective reports (given 
either at the end of the experimental series or after each 
observation) in all quantitative determinations, liminal and 
supraliminal alike. 


The degree of “ fore-knowledge” of the subject, the form 
in which he is asked to give his judgment—in terms of standard 
or variable, of right-hand or left-hand stimulus, etc.—and 
other recognisable psychological factors directly bearing upon 
the quantitative determination, should be varied in as many 
ways as possible and the corresponding results noted ; the effects 
of practice, fatigue, “contrast,” “side-comparisons,” “absolute 
impression,” etc. should be reduced to a minimum or else 
rendered constant and, where possible, measured ; in short, the 
quantitative determinations should always be made for the sake 
of the psychological processes involved, in the fullest sense of the 
phrase, and not conversely. The problems of the psychological 
processes of discrimination, judgment, etc. thus raised are of the 
first importance, but their discussion would take us beyond 


* Cf. Kiilpe: the probable value of the limen is “that stimulus difference 
which is just as often cognized (correctly judged) as not cognized (incorrectly 
judged) ”’—quoted from Sanford; Experimental Psychology, footnote, p. 351. 
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the scope of this book. We can only just mention them here, 
emphasize their importance, and indicate the relation in which 
they stand to mental measurement in the narrowest sense. 


Frequency-Distributions. 


We have seen above that the graphical representation of 
the percentages of any one of the three kinds of judgments 
obtained in the Constant Method, although sometimes approxi- 
mating to the form of a Gaussian frequency-distribution, is in 
many cases decidedly “skew.” In these latter cases the most 
representative limen would be given by the mode of the smooth 
curve which best fits the figure. Unfortunately the determina- 
tion of the equation of such a best-fitting curve must be 
almost entirely empirical in nature, since we have so slight 
a theoretical basis on which to build. Urban’s notion of the 
probability of a judgment, illustrated in the preceding section, 
seems to be the most promising as a starting-point, and there 
is little doubt that his publications* are the most important 
for the modern student of psycho-physics. Much still remains 
to be done, however, and great caution is needed in the theo- 
retical estimation of results. We have already suggested that 
the identification, or even the approximation of graphical 
representations of “percentage ” judgments to true frequency- 
distributions is of extremely doubtful validity. This will 
become clearer on a consideration of the nature of the latter. 
True frequency-curves represent the distribution of “random 
samples” taken from a “total population,” and therefore have 
a definite mathematical basis in the theory of probability. 
They correspond to the results obtained in drawing balls from 
a bag, or in teetotum-spinning, dice-throwing, etc. Cases of 
this kind obviously occur in great numbers in psychology,— 
e.g. the distribution of a particular kind of mental ability in 


* In addition to articles already quoted, see ‘“‘ Die psychophysischen Mass- 
methoden als Grundlagen empirischer Messungen,” Archiv f. d. ges. Psychologie, 
Vols. xv. and xvr.; also, The Application of Statistical Methods to the Problems 
of Psychophysics, Philadelphia, 1908. 

3—2 


36 THE PSYCHO-PHYSICAL METHODS 


a group of individuals,—and hence, though not strictly relevant 
to the subject of the present chapter, they are relevant to the 
general problem of mental measurement, and are most con- 
veniently discussed at this point. 


The general theory of curve-fitting has been worked out 
in great detail by Professor Karl Pearson*. A good account 
of his method is given in a book by W. Palin Elderton, 
Frequency-Curves and Correlation, C. and EK. Layton, London, 
1906, to which the mathematical reader is referred for fairly 
complete theoretical and practical information. The following 
short sketch of the outline of Pearson’s system will perhaps serve 
as a reminder for readers already acquainted with the methods, 
and in any case will be of some little use in indicating the 
general significance of biometric notation which will be em- 
ployed later on in this book. 


Frequency-curves of data not involving a mixture of species 
tend to commence at zero, rise to a maximum and then fall 
either at the same or at a different rate. There is often high 
contact at one or both ends of the distribution. An equation 
of the general form 


ey lees 
dix f(z) 
satisfies these conditions, since if y=0, ot =0 (high contact) 


and if s=— = (0 (maximum; for a maximum, again, the 


dy 
a, da 
second differential coefficient must be negative). Expanding 
J (@) by Maclaurin’s theorem, we have 


dy _ (e@+a)y 


dt G+¢6¢+608+60+...° 


* Karl Pearson, “Skew Variation in Homogeneous Material,” Phil. 
Trans. Vol. 1864, 343 ff.; ‘‘On the Systematic Fitting of Curves to Observa- 
tions and Measurements,” Biometrika, Vol. 1. pp. 265 ff. and Vol. u. pp. 1 ff., 
1901—1903; ‘*On the Curves which are most suitable for describing the 
frequency of Random Samples of a Population,” Biometrika, Vol. v. 1906, 
172—175 (an exceedingly clear summary of the principles involved), 
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(1) Putting o=q=¢,=...=0, we have 
ldy_a+a 
Gae 6, — 


which is the Gaussian or normal curve. It fits the symmetrical 
binomial, (4 +4)”, e.g. in coin-tossing, where the chances for 
and against are equal (p=q), and the contributory causes are 
independent of one another. 


(2) Putting c.=¢c,=...=0, we have 
ldy_ “+a 
yde CG+e,x’ 


which represents a class of curves varying from the Gaussian 
curve to the J-curve. It fits the asymmetrical or “point” 
binomial, (p+q)", e.g. in teetotum-spinning or dice-throwing, 
where the chances for and against are not equal, p+q, but the 
contributory causes are independent of one another. 


(3) Putting ¢,=c,=...=0, we have 
idy | ea 


Yor Groner eg 


which can be made to represent almost all the frequency 
distributions which may arise. It fits the “hypergeometrical 
series,” the successive terms of which give, e.g., the chances of 
getting r, r—1,... 0 black balls from a bag containing pn black 
and gn white balls when r balls are drawn*. 

Here the contributory causes are not independent of one 
another. 

There is no advantage in employing equations which involve 
c,; and higher constants, because their use involves the calculation 
of the 6th and higher “moments,” and these have very high 
probable errors. 


Definition. The nth moment coefficient (u’,) of any distri- 
bution about any ordinate is the sum of the products of the 


* The series is: 


pn(pn—1)...(pn—r+1) rqn at | qn (qn-1) 
m(n—1)...(n—r4+1) tof 


pn—-r+1 a 2 (pn-7+1)(pn—r+2) 


Other series may arise. 
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partial-frequencies and the nth power of the distances of these 
frequencies from the ordinate, divided by the total frequency. 


|— centroid 
vertical 


~ 


In symbols, if V be the total frequency, 
NN a, = faery de. 
The moments are, in practice, first calculated about any 


arbitrary ordinate that is most convenient, and then reduced to 
moments about the centroid in the following way* 


= {(a — a) y dx 


a | ; 
= Ne'n ee WAIN IL oa + oe GN hs aay eeee 
This gives the general reduction formula 
n(in-l n(n—1L)(n—2 
Bn = pn — Nf nat = ae TE a3 ( 1 Le UR ae is 


which enables us to transfer any moment from an arbitrary 
ordinate to the mean. Thus we have: | 

Po = We — WY, 

3s => b's — Bw’, + 2p’, 

peg = fg — Ape y's + Ow’ Pp'g — Bu 

ws represent moments of the curve; but we have to start with 
grouped frequencies, where the frequencies are assumed to be 
concentrated along the mid-ordinates of the rectangles (ef. 
Fig. 8, B). The moments obtained from these grouped fre- 
quencies are denoted by v’s (dashed and undashed), and correc- 
tions are necessary, These have been deduced by Sheppard+ 
and are consequently known as Sheppard’s adjustments. 


* Dashed w’s represent moments about an arbitrary ordinate, 
undashed ws a 55 ,, the centroid vertical. 
+ W. F. Sheppard: ‘On the Calculation of the most Probable Values of 
Frequency Constants, for Data arranged according to Equidistant Divisions of 
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They are: bs = Vy — zy, 


fy = Vy — $Y + oy: 

It is generally said that they are only valid when there is 
“high contact” at the ends of the frequencies, but the equations 
for uw, and pu; are probably still valid, even without high contact, 
if the terminal frequencies are zero. 


[N.B. In working, v’s are changed into v’s before applying 
Sheppard’s corrections. | 
It is obvious that », =1 and v,=0. v’, is the distance of 
the mean or centroid vertical from the arbitrary ordinate about 
which the moments are first taken, and 1s conveniently known 
80. 6 =A fle 
Two very important constants in curve-fitting are 
ps p- 
B,. ==, and 2,=—. 
fe pe 


3? 


The values of these are always to be calculated, and, within 
the limits of their probable errors (for which see Rhind’s Tables, 
Biometrika, Vol. vit. 1909), they fix the type to which the 
curve belongs. The general frequency-curve equation, written 
in terms of moments, 1s 


pag NB1 (B23) 
2 


5 l= - 5B = O19 
y dx 32 485 — 3P, 4 NB (B2+3) @ 28,-38,-6 2 it 
1062— 126, -18 2 5B2-68,-9 6 1085 = 128; — 13°\3 


This gives at once, for the distance between the mean and 
mode, 
oa o VB, (B. +3) 
2(5P2 _ 6P, ie 9) 
Hence the curve is symmetrical if 8, =0. If 6,=0, and 
8,= 3, the curve reduces to the Gaussian, since the terms in- 
volying # in the denominator of the right-hand side of the 


o = (Origin is at mean.) 


general frequency-curve equation vanish. 
In using the Pearsonian method, then, the order of pro- 
cedure to be adopted is: 


a Scale,” Proc. Lond. Math. Soc. Vol. xxx. pp. 353 ff. Karl Pearson: ‘On an 
Elementary Proof of Sheppard’s Formulae for correcting Raw Moments and on 
other allied Points,” Biometrika, Vol. 111. 1904, pp. 308 ff. 
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(1) calculate the moment-coefficients v’,, v's, v’;, v's about 
a convenient arbitrary ordinate, 


(2) transfer to the mean by the equations 
iP, =a vs, 
Ve= Ve — BV iP _.+ Qi, 
Vz = V',— 40’, v's + 6’? vv’. — 80" 
(v’,, or d, is the distance of the mean from the arbitrary 
ordinate), 


(3) determine the corresponding moments for the curve 
by the equations 


fo = V2 — Fy 
fs = Vs Sheppard’s corrections. 
fy = Vy — 92 + hy 

[N.B. For these corrections to be applicable, two conditions 

must be fulfilled : 

(i) there must be high contact, 

(ii) the grouping of the frequencies must be equal. ] 

(4) Calculate @, and ®, by the equations 


Ye 
aay es 
BP pero ye 


These results give the distance of the mean (v‘, or d) from 
the arbitrary ordinate, the standard deviation (1/2), and the 
mode, 


o VB, (B, +3) 
2(58, — 68, — 9) 

The investigator may then proceed to determine to which 
of the seven Pearsonian “types” the particular curve belongs, 
to find its equation, and to plot it. For instructions on these 
points the reader is referred to Palin Elderton’s book already 
mentioned. 


mean — 
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CORRELATION 


INTRODUCTION 


A somewhat detailed account of the mathematical theory 
of correlation and of the way in which it may be usefully 
applied to psychological measurements will be found in the 
later chapters of this Part. The object of the following 
introductory pages is to give the reader a general preliminary 
view of the method, free from mathematical complications, and 
to illustrate it by means of a simple example. 


Correlation may be briefly defined as “tendency towards 
concomitant variation,” and a so-called correlation coefficient 
(or, again, correlation ratio) is simply a measure of such 
tendency, more or less adequate according to the circumstances 
of the case. J. 8. Mill, in his “System of Logic,” distinguished 
a special scientific “ Method of Concomitant Variations,” which 
he based upon the following principle : 

“Whatever phenomenon varies in any manner whenever 
another phenomenon varies in some particular manner, is 
either a cause or an effect of that phenomenon, or is connected 
with it through some fact of causation*.” 


The instances of this principle which Mill had in mind were 
mainly cases of approximately “complete” concomitance of 
variation, such as those usually met with in the domain of 
Physics. In such cases, the conditions of an experiment admit 
of a high degree of simplification, the phenomenon, or series of 


* Logic, Bk, 13. Ch. viii. § 6, 
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phenomena, under investigation can be isolated with tolerably 
complete success, the “irrelevant” factors can be reduced to a 
minimum. Under such conditions, when the degree of con- 
comitance of the different corresponding measures of the two 
phenomena is found to be very high, the slight deviations 
from complete correspondence are put down to “errors of 
observation” or other unavoidable imperfections in the experi- 
mental method employed. 


If the correspondence is one of simple proportionality, so 
that the graphical representation of it (one phenomenon being 
measured along the axis of w, the other along the axis of y) 
is a straight line*, the correlation coefficient, 7, will be=1. 
Eaample: the variation of the length of a metal rod with 
temperature. | 


If the correspondence, although still approximately complete, 
is not one of simple proportionality, the graphical representation 
of it will be, not a straight line, but a curve of greater or less 
degree of complexity+, and the correlation, also complete, will 
be measured not by the correlation coefficient, 7, but by the 
correlation ratio, 7. 7 in this case will be=1. Haxample: the 
variation (inverse) of the volume of a certain quantity of gas 
with the pressure to which it is subjected, the temperature 
remaining constant. A number of pairs of values, P,, Vi; 
P,, V2; Ps, V3; etc. are obtained, and when plotted are found 
to give a “scatter diagram” of the approximate form of 


Figure 9. 


In this figure, the dots represent the individual pairs of 
observations, P, V. They cluster very closely about the 
hyperbola, PV =k, represented by the broken curve. The 
curve is assumed to represent the “real” or “true” relation of 
the two “variates” (as we call such quantities as P, V), and 
the slight deviations of the observed values from this curve are 
explained as due to errors of observation and to other factors 
irrelevant to the relation under investigation. However this 
may be, the interesting point about the figure so far as our 


* Hence the correlation is said to be “linear.” 
+ Correlation: said to be ‘*non-linear” or “skew.” 
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present purpose of explaining correlation is concerned is that 
any definite observed P-value is “correlated” with a plurality 
or “array” of observed V-values, and that, similarly, any 


Fig. 9. 


definite observed V-value is correlated with a plurality of 
P-values. These arrays of observed values cluster extremely 
closely about their means (situated on the curve), Le. their 
“scatter” or “variability,’ as measured by their standard 
deviations (c), is extremely small. 


The modern theory of correlation is directed towards the 
manipulation of observations made upon phenomena of a much 
greater degree of variability than that found in the case of 
isolated physical phenomena. ‘The increased variability is no 
doubt due, in the main, to the complexity of factors involved. 
The elementary factors do not admit of isolation, and with 
reference to the concomitance of variation of the two series of 
phenomena under consideration they, as 1t were, pull in different 
directions. The correlation coefficient and correlation ratio 
measure, in these cases, the average extent of the concomitance. 
As will be explained more fully in the next chapter, 7 can 
only be taken as a measure of correlation when the average 
relation between the two variates is linear, and in this case its 
value is identical with that of ». When the relation is non- 


— > general intelligence. 
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linear r is practically meaningless, but 7 still measures the 
relation accurately. 


The general problem will become clearer by reference to 
the accompanying figure. 


x 


88] 


O tM *, 


— > mech. memory. 


Fig. 10. 


Let us assume that we have a group of 200 school-children 
and have measured each of them for mechanical memory («) 
and for general intelligence (y). Each of the dots in the figure 
represents a child. Then if wedetermine the mean y-values 
corresponding to each successive “group” of #-values, e.g. 7 to 
x,, by assuming the observations concentrated on the mid- 
ordinate PM*, the line ABT drawn through these mean 
y-values (marked by crosses in heavy type) represents the law 
of change of mean y-value with increase of # and gives the 

* The true centroid ordinate is slightly nearer the denser part of the scatter- 
diagram, here slightly towards the right of PM. The correction is made later 
by means of Sheppard’s formulae (see above p. 39, and Appendix II, p. 137). 

+ I have placed the means on the straight line for the sake of convenience 
of exposition. Actually, they will occur irregularly on either side of it, and 


AB will be the “ best-fitting”’ straight line, determined by an application of the 
Method of Least Squares. See next chapter and also Fig. 16, Appendix II, p. 140. 
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“most probable” value of y for any particular value of w. If 
the line is straight or approximately straight the “regression ” 
is said to be linear, and the equation to the line is 


— Go = 
eae) 


where @,¥ are the mean values of all the a’s and y’s respectively 
(not the means of the arrays just mentioned), 
o,, o, are the standard deviations of the a’s and ys respec- 
tively and r is the coefficient of correlation. 
Thus, if 
20 is the mean value of the mechanical memory of the group 
BO0re a » » » general intelligence ., _,, 2 
4 ,, ,, standard deviation for mechanical memory (¢,) 
ae es ‘ - , general intelligence (c,) 
and, finally, ‘6 is the value of 7; then the most probable 
measure of the general intelligence of a child whose mechanical 
memory is represented by, say, the value 14, is given by the 
equation 
y — 30 =°6 x £(14— 20), 
whence y = 23°7. 
This value is the average of an array of possible values, whose 
standard deviation 
ee ae 
= 5°6. 

It will be proved in the next chapter that 

S (ay) » 

~ Now: 
where w and y are deviations from the mean (not absolute values 
as assumed above), S(_) indicates summation, 1.e. 

S (ay) = 291 + LoYo t+... + en yn, 

and N =the total number of cases (children measured). 


* First suggested by Bravais; shown to be the best measure by Karl Pearson, 
who gave it the name of the ‘‘ product-moment formula.” 

A. Bravais: ‘‘ Analyse mathématique sur les probabilités des erreurs de 
situation d’un point,” Acad. des Sciences; Mémoires présentés par divers 
savants, 11° Série, rx. 1846, p. 255. 

Karl Pearson: ‘‘Regression, Heredity, and Panmixia,”’ Phil. Trans. Roy. 
Soc., Series A, Vol. cuxxxvu1. 1896, pp. 263 ff. 
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It is important to note that by starting from y instead of 
from #, and determining the means of the y-arrays (such as the 
array within the limits 7's, y;y’;), another regression line, CD, 
is obtained different from the first. Its equation is 


fe O71 me 
7 —-e=r—(y—7 
a eteagh 


and it represents the law of change of mean g#-value with 
increase of y. It gives the “most probable” value of # for any 
particular value of y. 


If the series of means do not lie on a straight line (approx.) 
but on a curve of greater or less complexity, the above calcula- 
tion is meaningless. In such a case, called a case of skew 
correlation and non-linear regression, the only measure of the 
correlation of the two variates is that given by 7, the correlation 
ratio. 1 1s the ratio of the standard deviation of the means of 
the arrays (=) to the total standard deviation (of either the a’s 
or the y’s). Thus there are two values of 7, one for the a’s, and 
another for the y’s. They approximate closely to one another, 
as a rule, so that only one need be calculated. 


a =F 


j=, OF —, 
O71 CO» 

When the regression is linear, 7 =r; otherwise 7 >7. 
r ranges between the values +1, 7 between 0 and 1. 7 is 
always positive. 

It will now have become clear that the correlation ratio, 
(always) and the correlation coefficient, 7» (when regression is 
linear) are measures of the tendency towards concomitant 
variation exhibited by two series of phenomena, and hence 
throw some light upon the causal relations of these phenomena. 
Exactly what kind of causal relation we are justified in inferring 
from them will become clearer in the course of the next few 
chapters. 


We may illustrate the significance of the idea of correlation 
in a slightly different (and more elementary) way. Let us 
suppose that the 200 children have been arranged in order of 
merit, as regards mechanical memory, on the one hand, and as 
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regards general intelligence on the other. If now it were found 
that each child’s order was the same in both, ie. that the child 
first in mechanical memory was first in general intelligence, the 
child second in mechanical memory was second in general 
intelligence, and so on, the correspondence between the two 
series would be complete and r would be +1. Or if, on a 
second supposition, the child first in the one was last in the 
other, second in the one was next to last in the other, and so 
on, the correspondence between the two series would again be 
complete, but inverse, and r would be —-1. Finally, if there is 
no correspondence whatever between the two series, 7 will be 0. 
A value of r between 0 and+ 1 will express a tendency, greater 
or less according to 7’s size, for children above the average or 
mean position in the one ability to be above the mean position 
in the other, and for children below the mean position in the 
one to be below the mean position in the other. A value of r 
between 0 and—1 will express a tendency, greater or less 
according as r 1s numerically greater or less, for the children 
above the mean position in the one ability to be below the mean 
position in the other, and conversely. Now if order or “rank ” 
be taken as an (inverse) measure of ability, the value of 
S(a2y) 
ye or r 


b — 6S (d?) 
ecomes N(?—1)’ 


where S (_ ) is the symbol for summation, and d is the 
difference between the rank of an individual in the one series 
and his rank in the other. This form gives us a general 
impression of its appropriateness for the purpose in view, since 
the greater the disparity between the two series of ranks the 
greater is S(d?) and hence the smaller is 7. If there is no 
relation at all between the two series, S (d?) acquires the value 
it would have according to pure chance, and this can be shown 
to be=4 N (N?—1), which makes the whole expression 0, as it 
should do. 

The one objection to the formula is that 1t assumes the 
difference between any two neighbouring ranks to be equal at 
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all parts of the scale. This is obviously a false assumption ; 
the distance of individual from individual at the two extreme 
ends of the scale must be considerably greater than that 
between individuals near the middle. A correction for this, 
based on the assumption that the form of distribution of the 
abilities in each of the cases is Gaussian, has been calculated 
by Prof. K. Pearson. It is 


or 
r=2sin (Fp), 


: 6S (a?) 
where = 1—- W (N?—1) : 

At the end of this chapter is given a table whereby p-values 
may be at once converted into corresponding 7-values, according 


to Pearson’s equation. 


Finally, there is the question of the “ probable error” (P.£.). 
Like every other constant calculated from a limited sample of 
variable material, the coefficient of correlation varies in value 
from sample to sample, and a measure is needed of the limits 
within which it may be expected with a fair degree of proba- 
bility to le. This measure is given by the “probable error.” 
In the case of r determined by the product-moment formula, 
when J is sufficiently large, 


pa = =", 
which means that it is an even chance that the true value of 
67449 (1 — 1?) 
The chances are 16 to 1 against the value falling outside 
the limits 


r lies between the limits r + 


r+3PE. 
For r determined by the rank-formula, the probable error is 
slightly larger, being Ce ») 


If NV, the number of cases, be small (say, less than 30), the 
probable error is larger. Its exact size under such conditions 
is not known. 

B. 4 
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The following is an example of the way in which a correla- 
tion coefficient may be obtained by means of ranks. The 
subjects were boys in the Fourth Form of a Public School, and 
the correlation to be obtained is that between ability in 
Classics and ability in Drawing. 


Form Order 
BAS 


Classics Drawing d? 
RaCsO 1 9 (1w9)2= 64 
H.G.M 2 2 0 
BL. 9 16 49 
F.L.S 7 6 1 
C. M.S. 3 15 144 
Ceo 5 4 i 
iN Np g 6 17 121 
E.G. T 4 3 1 
Ce oe 8 5 9 
N. P.R.N 11 14 9 
H. B.D 10 1g 4 
s.r 14 7 49 
H.B.M ye 1 eal 
L.H.8 13 8 25 
Jal 15 10 25 
E. W. 16 18 4 
C7C.M. 17 ig) 36 
L. H. W. 18 13 25 
BM. J. 19 19 0 
N=19 688 = § (d?) 

2) SS eee 
p Nie liye ieee 


oii eta (F p) = 416, 


_ 7063 (1 — 7?) 
ee 

r is here just over three times its probable error, and we 
might therefore feel inclined to conclude that it proves a real 
correlation between the two series. We must remember, how- 
ever, that 19 1s a very small number of cases, and that therefore 
the real probable error is considerably larger than that given by | 


PE. =-134. 
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the formula. Hence the reality of the correlation is not so 
certain. Our caution is proved to be justified when we turn to 
the next higher form, the Remove, and find that, with the same 
number of boys, the correlation between Classical ability and 
Drawing ability works out as —‘313(+°'14), quite a different 
result. It might be objected that other factors than mere 
smallness in the number of cases were responsible for the 
difference ; e.g. that the tendency to specialise in Classics was 
greater in the Remove than in the Fourth, and that the 
consequent neglect of Drawing by the abler boys lowered the 
correlation. To this it may be replied, firstly, that the drawing- 
master was the same for both forms, and was likely to get as 
much out of the boys as possible in each case, and, secondly, 
that the difference between the two forms in respect of the 
degree of specialising tendency was insufficient to account for 
the disparity of the results. 


The correct way to compare the results mathematically is 
to determine the probable error of their difference. This = the 
square root of the sum of the squares of the probable errors of 
each, Le. 


P.E.q_p = VP.E.g? + P.E.3?, 
which, in this case, = V'132 4°14 
="19. 
The difference = ‘416 + °313 =°73, 
its probable error ='19, 


nearly four times the size of the difference. 


A very important extension of the theory of correlation is 
the conception of “partial” correlation. If, e.g., three mental 
abilities are correlated with one another, it is of interest to 
know how closely any two of them are correlated with one 
another for a constant value of the third. Such a coefficient is 
written, in Yule’s notation, 7,,.3, and measures the closeness of 
interrelation of the first two abilities independent of the 
relation between them due to their respective relations to the 
third ability. 

4—2 
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This may be illustrated from our example by taking the 
form-order for English into consideration in addition to that for 
Classics and for Drawing. The correlation between Classics 
and English works out as ‘78, that between Drawing and 
English, as ‘21. 

Then the correlation between Classics and Drawing for 
“English constant ” 

ep YoD —TCETDE 
V(1 —r°ox) (1 — pz) 


Yoo 8 xed 
[Nd Tea 219 
== 42. 


Thus, in this .particular case, the “partial” coefficient 1s 
practically identical with the “ entire” coefficient, showing that 
the relation between Classics and Drawing is practically 
independent of their common relation to English. Calculating 
the other two partial coefficients, we get 


ron.p=18, Yup.c='08. 


The only coefficient showing an appreciable difference 
between entire and partial correlation is that between English 
and Drawing: entire=‘21, partial=°58. This result means 
that that part of ability in English which is not identical with 
ability in Classics is related more closely to that part of ability 
in Drawing which is not identical with ability in Classics than 
is entire English ability to entire Drawing ability. 

The result might be illustrated by the intersection of three 
spheres representing Classical, Drawing, and English ability 
respectively, after the analogy of Kuler’s circles in Logic, the 
volume common to any two of the spheres being proportional 
to the “entire” correlation coefficient for the two abilities, and 
the volume common to the same two eaclusive of any part also 
common to the third sphere being proportional to the “partial” 
correlation coefficient for the two abilities, This cannot, of 
course, be taken as any more than an illustration, since the 
exact significance of the correlation coefficient, in this sense, is 
not known (see below p. 80). 
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The principle of partial correlation can be extended to 
include an indefinite number of variables, and general formulae 
for this purpose will be given in the next chapter. 

It will be obvious that the same formulae may be employed 
for the elimination of “irrelevant” factors, e.g. age, sex, ete. 


Table for converting p into r (r= 2sin | p)*. 


p ? p 7 p i p Yr 
‘O05 052 30 313 ‘55 568 80 813 
‘10 105 35 364 ‘60 618 85 861 
15 "157 40 416 65 668 90 908 
20 | °209 A5 467 ‘70 7a 95 954 
oF | “261 | “50 518 75 765 1:00 | 1-000 

pa eee Sh Se sili 


* Quoted from K. Pearson, Drapers’ Company Research Memoirs, Biometric 
Series tv. 1907, p. 18. 


CHAPTER I* 


THE MATHEMATICAL THEORY OF CORRELATION 


In the present chapter an attempt will be made to summarize 
briefly the principal methods in use for obtaining a measure of 
the correlation, or tendency towards concomitant variation, of 
two or more variates. 


Let the coordinates of the dots in the accompanying diagram 
—commonly known as a “scatter diagram ”—represent the 
measures of two separate characteristics, e.g. speed of adding 
figures (#) and accuracy of adding figures (y), in a number of 


individuals (1). 


accuracy of adding (y) 


O speed of adding («) 
Hig. -i1, 


Let the crosses represent the mean values of y corresponding 
to values of « lying between the limits of pairs of successive units 
of measurement. Then the broken curve CC" passing through 


* The proofs of the formulae for 7, 7, and the “‘two-row table” to be found 
in this chapter are abbreviations of those given by Professor Karl Pearson in 
his lectures. 
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these crosses represents the most probable law of relationship 
between speed of adding and accuracy of adding, and is known 
as the regression curve. 


I. Correlation coefficient (7). 

Let us now find the “best fitting” straight line, LL’, to 
this curve. To do so we apply the method of least squares, 
merely from motives of convenience; 1.e. we choose a line such 
that the sum of the squares of all distances like PM, each 
weighted with the number of cases (n,) from which the mean 
value of the corresponding array of y’s was determined, is a 
minimum. 

Let ordinate of P be Y and ordinate of M be Y, (the mean 
of an array). 

Then S {nz (Y — 7z)?} = V (say), which is to be a minimum. 

Let equation to line ZL’ be Y= Aax+ B. 


Then VSS ig MAG BS Ye Pm sacedemes nes ag (a). 
For V a minimum, this gives* 
S(ey) — 
pa EE ag ) Faas 
, ee ul Z S(@— %)(y — ¥) 
a Ne; : 
and also y= Az +B 


(%, y being the mean values of the a’s and y’s respectively, and 
o,, o, their standard deviations). 


* V=S{n,(Av+B -9,)?} =A?.S (n, x?) + B?. S(n,) +8 (n, 9,2) +24B. 8 (n, 2) 
-—2A.S(n,2y,) -2B.S(nyYz), 


=. 
Y =A? (02439) + Bet etata bs sana oa Seite — 2By. 


iv N 
; wie a {V d (V 
For this to be a minimum, dA (5) anda5 (5) must both be zero, and 
a eV. a fY ae 
aA (x) ap (x) must both be positive. 
[4 (o2-+ 3) + Bz — “Mxt be) _o Oe AC ert (1), 
lana AB iN aisincca vat cuncoicnw dive aceon (2). 
Solving these equations for A and B, we have 
S (N_€Y x) FAT, S (xy) Gy} 
eee TSF oRia=a) =a 
— ree 2 ; 


A 
and B=j-~ Ai. 
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Therefore the line passes through the means of the two 
values, and its equation is 
y—y=A (“#-@). 

_S(@—-@)y-%) 

S Noo, 
Then Dk a a Oo 


¢ Ut > 
No? o; 


Let us define 7 as 


and the equation to the best-fitting straight line, LL’, is 


LI’ is known as the regression line, and es is called the 
1 
coefficient of regression of y on x, being the tangent of the angle 


which this line makes with the axis of z. 


Let y — Y measure the distance of any individual point from 
this line. Then the average of the sum of the squares of all 
such distances 


_Sy-¥Y 
N 
_Sly-7-4@—a 
N 
=o, + Ao, —2Ara.o, 
=o, 1 =): 


Hence the standard error (standard deviation) made in 
estimating the value of y most probably associated with any 
particular value of « from equation (8) is 


r is known as the coefficient of correlation, and evidently must 
he between the values +1 and —1. If the regression line 
coincides with the regression curve, within the limits of errors 
of random sampling,—in other words, if the regression is linear 
—r is a measure of the degree of dependence between « and y. 
When r=+1, the points close up upon the line and the 
“scatter diagram” contracts to become the line itself. 


* Yis the y of equation (8), and therefore =¥+ A (a —2Z). 
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If « and y be measured from their mean values, instead of 
from zero or some other arbitrary origin, we get 
Pe oad 
Na,o,’ 


and the regression of y on w is given by the equation 


d O7 : ; 
An analogous equation, «=r — y, gives the regression of x on 
Oo 


y. There are thus two regression lines. The coefficients of 


: Oo Oo : 
regression, 7 — and 7 —, are conveniently denoted by the symbols 
a} Oj 


by. and b,, respectively. 

If « and y be measured from their means and also in terms 
of their respective standard deviations as unity, the regression 
equations become 

x=ry and y=re, 
and r is then itself the coefficient of regression of y on # and of 
x on y, the two regressions being equal. 


II. Correlation Ratio (n)*. 


It is clear that if the regression is not linear r ceases to be 
a satisfactory measure of the relation between the two characters 
under consideration. In an extreme case, such as that shown 


eee 
e.%@ @ °°. 8 e 
os % Raee ak ‘. 
e eo 


cS> 


> 


Fig. 12. 


* See Drapers’ Company Research Memoirs, Biometric Series 1. pp. 9 et seq. 
Karl Pearson ‘‘On the Theory of Skew-Correlation and Non-Linear Regression.” 
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in the accompanying diagram, r may be zero while there is yet 
a very close relation between the two characters. 


Let y—Y,z be the distance of any point of the “ scatter 
diagram” from the regression curve. Then the average of the 
sum of the squares of all such distances, each w-array of y's 
being weighted with the number of individuals (n,) in it, 


S (ney — Yay} 
N ? 


and this expression, on reduction, 
= a," = pa 


where &=the standard deviation of the means of the arrays, 
each array being ie es with the number in it; 


a ee a a 


Let yet 


Oo 


Then S [Ng (y att Yu) ie oe? @! ae n°), 
N 
and the mean square of the distance of points from the regres- 
sion curve 


= eV NS aL tine eee (€). 


Comparing expressions (¢) and (y), we see that 7 1s a test of 
the linearity of the regression. If 7 =r, within the limits of 
errors of random sampling, the regression is linear. 


Since » is the ratio of two standard deviations, 1t must 
always be positive, and from (e) we see that it cannot be greater 
than 1. 


Let Y be the ordinate of any point on the regression line. 
Then the average of the sum of the weighted squares of the 
distances between the regression line and the regression curve 


S [ne (Yo — Y)°} 
N ; 


which reduces to 
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Thus 7 must always be numerically greater than 1, except 
in the case of linear regression, when it is numerically equal 
GOP, 


In examining the relationship between two measurable 
characters, 7 should be calculated as well as 7, since it serves as 
a test of the linearity or non-linearity of the regression, and is 
also a better measure of causal relation than r. 


A simple criterion for linearity which is very generally 
applicable is that 
VN 


o14ag BNP P< 25" 


For very exact work, more complicated formulae need to be 


employed. 


The results obtained above are all independent of the forms 
of distribution of the variates. 


In Appendix II. will be found an example of the evaluation 
of r and of 7 between speed of adding single digits and accuracy 
in doing so, the individuals measured being 86 boys between 
the ages of 11 and 12 years from two L.C.C. elementary schools. 
The two groups could be thrown together for this purpose, since 
the means and standard deviations calculated from them 
separately were in very close agreement—well within the 
limits of the probable errors. The results will be discussed in 
Chapter IIT. 


Ill. Probable Errors. 


In determining means, standard deviations, and other fre- 
quency constants, the investigator is unable to work from the 
“total population” and must be content with the results 
obtained from “random samples” of greater or less size taken 
from this (in some cases, hypothetical) total population. Evidently 


* J. Blakeman: Biometrika, Vol. tv. pp. 349, 350. 
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the results obtained from different random samples will not as 
a rule coincide, and some measure is needed of the variability 
of these results about their average, which latter quantity, for 
a large number of samples, may be regarded as coinciding 
approximately with the value of the frequency constant for the 
total population. Such a measure is given by the standard 
deviation, o, of the various results obtained from the different 


samples, but, since the theory of probable errors was originally 
developed on the assumption of the “normal” or Gaussian form 
of distribution, the value universally taken is ‘67449 x o, which 
is conventionally termed the probable error (P.E.). In the 
normal curve (see Fig. 13) it 1s the distance OP, the position of 
P being such that the ordinate PP’ divides the positive half 
of the curve into two equal areas. Thus, if the total area 
of the curve is taken as unity, the value of P.E. is given by the 
equation 


whence P.E. = '67449c. 


In the language of chance, a deviation of + P.£. is as likely 
to occur as not, when the distribution of frequencies is “normal.” 
The value of a frequency constant determined from a random 
sample may be considered fairly reliable within the limits of 
+ (2—8 times its P.E.). 
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When the number of cases (n) in the random sample is 
fairly large—so large that fractions containing certain higher 
powers of n in the denominator can be neglected—the probable 
errors are found to be as follows* : 


P.E. of a mean = '67449 ro 
n 


67449 <— 
» ” Oo = /In Z 


es r=: 674491" 


Vn 


The second and third of these values are only correct when 
the frequency distribution is normal or approximately normal. 
In particular, for large values of 7 the true P.E. may be con- 
siderably different from that given by the above formula unless 
the distribution is normal. 


PE. OL y= 67449 L—7 , for linear regression, and also, as a 


ee 


rough measure, for cases of skew-correlation. If greater exacti- 
tude is needed in the latter cases, more complicated formulae 
have to be employed *. 


Another frequency constant in common use is the coefficient 


oie = 
mean 

lispr. =67449V 1142 (a0) f VIn 

a 100 7 


As stated above, the values just given for the probable 
errors only apply in cases where n is fairly large. In cases 
where n is so small that certain higher powers of its reciprocal 
cannot be neglected in comparison with the rest of the expres- 
sions involving them, the values cannot be used. For such 
cases no theoretical formulae have hitherto been devised. 

* See W. Gibson: “ Tables for Facilitating the Computation of Probable 
Errors,” Biometrika, Vol. tv. pp. 385 et seq. 
+ Karl Pearson : op. cit. (Biometric Series 11.) p. 19. 


+ Calculated values for different values of n given in Gibson’s Tables, just 
snioed: 


62 THE MATHEMATICAL THEORY OF CORRELATION 


Quite recently, however, an empirical investigation has been 
made on samples of 4, 8, and 80 cases, taken from a “ total 
population” of 8000 pairs of measurements (height and left 
middle finger measurements of 3000 criminals; “real” correlation, 
‘66)*. In the case of correlation it was found that when the 
original pairs of measurements were arranged so that the “real” 
correlation was 0, the distribution of the values of 7 obtained 
from samples of m was given approximately by the formula 


n—A 


y=y1- ae 


Five distributions in all were worked out, giving the follow- 
ing moment coefficients : 


| Mean | 8.D. My Ms bby By B, 
Samples of 4(r=0) | — |-5512/-3088 | —- ‘1768 1} oes 
1 8 (r=0) Pe: ‘3731 | -1392 = ‘0454 aii 28S 
. 4 (r=-66) | 5609 | -4680 | -2190 | —-1570 | -2152 | 2245 
8 (r="66) | 6139 | 2684 | 07202 | — 02634 | 02714 | 1°857 | 5-232 
: 30 (r=°66) | -661 | -1001 | 01003 | ~-000882 | 000461 | 7713 | 4°580 


| Unit of grouping for 4’s and 8’s=-04. | 


Correlation results, for real value of r = °66, were 


samples of 4...... DoOlY O11, 
- Bost: § ee she ‘614 + 065, 
5, Ps ae ‘6609 + 0067. 


Hence it may be concluded that, although in the case of 
such small samples as 4 or 8 the ordinary formula for the P.E. 
of r gives much too low a value, yet in the case of as many as 
30, the formula applies with tolerable accuracy. We must, 
however, bear in mind that this result has only been proved 
(empirically) to hold in the single case where the actual 
correlation was ‘66. 


* “Student”: ‘The Probable Error of a Coefficient of Correlation,” Bio- 
metrika, Vol. v1. pp. 302-310, 1908-1909. 
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IV. Multiple Correlation*. 


The regression equation for the two variates, « and y, both 
measured from their means, was found, on p. 57, to be of the form 


C= Dio, 
the equation being obtained by an application of the method of 
least squares, on the assumption that the regression was linear. 


It is evident that the method thus appled to the case of 
two variates might, without further assumptions, be used in 
the general case of n variates. Thus, if 7, a&%,...#, denote 
deviations from means, the equation expressing the regression 
of #, on @,... £,, could be written 

= Ce ear Hy + DiecueG a ale a Cini nek Uy e+ a: 
“In this notation the suffix of each regression coefficient 
completely defines it. The first subscript gives the dependent 
variable, the second the variable of which the given regression 
is the coefficient, and the subscripts after the period show the 
remaining independent variables which enter into the equation.” 
(Yule, op. cit. p. 182.) 


In terms of the same notation, write 


1 
P15.34...0 = (ys, 4.0.7 » On, vee Peete eeeneeees (8), 
Hy 03... = Wy (Onsen By + Urn Legh ser Ces Lig hoe kY), 
N oO 1 93 nN = S (#, 93 a) @eoeweeveeeeeeeeeveee (8) 


Then the normal equations, from which the regressions are 
determined by the method of least squares, may be written 


S (@, » V1, 23 aa) = (2, . ee) = ey A a, . Ly saa) =O. (é). 
From these equations Mr Yule has deduced the following 


results: 
b 01. 34... 
12. 34...n — N19 84 (a, ae Te OLD OI CI CO ORCONO O (C), 
Oo. 34... 
O1.93...n= ra —f)(1 = oe —T" 14,03) «<6 a ee oer ae 2 (7), 
iia oS IG : Safer = 
rte 12.34...N—1 1M. 84. MAT OME (0) 


= z 1 
(1 a ree (1 ne ar wee 


* See G. Udny Yule: ‘‘On the Theory of Correlation for any Number of 
Variables, treated by a New System of Notation,” Proc. Roy. Soc. Vol. 79 4, 
pp. 182-193, 1907, of which the present section is merely a much condensed 
account. 
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1.34... 18 known as a “ partial” correlation coefficient, bemg the 
value of the correlation between 1 and 2 for constant values of 
3,4.... Similarly, by s..n 18 a “partial” regression coefficient 
(cf. “partial differentiation” im the Differential Calculus). 
Knowing the “total” correlations 7, 73, 73, etc., equation (@) 
enables us to obtain the various partial coetficients by successive 
substitutions. Thus, in the case of 3 variables, 1, 2, 3, we have 


Vip Te a 


V1 = Py V1 ah 


‘12.3 


and two similar equations, expressing the value of the correla- 
tion between two of the variables for a constant value of the 
third*. If a fourth variable be added, we have the further 
set of equations 


Ti2.3 — Vis.3 No.3 
re i, 
Pe tag tl 7) 
Perhaps a more convenient formula for obtaining the partial 


correlation of two variables for constant values of a third and 
fourth is 


V12.34 = 


Me (1 — es 234) =i (103 — Nog ma) = 114 (Tou — To 4) 


a tee Wie at Wr isTraT ss a —?r a4 7”. past 21's Ts Tog 


Tp 34 


When the partial correlation coefficients have been deter- 
mined, the regressions can be found by substituting in equations 
(n) and (€), and these give at once the regression equations. 
As explained before, a regression equation gives the most 
probable value of one variable for given values of the remaining 
variables, the standard error in such a prediction being 04,55...» 
ete. 


An example of the method of applying the above formulae 
is given in Appendix IIL, where the partial correlations, regres- 
sions, etc. in the case of four interrelated psychical capacities 
are worked out on lines identical with those illustrated by 
Mr Yule in his paper. 


* For an interesting representation of correlation between three variables 
by a model showing the distribution of points in space, see G. Udny Yule, ‘‘An 
Introduction to the Theory of Statistics,” C. Griffin and Co., London, 1911, 
pp. 241-243. 
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The probable error of a partial correlation coefficient yp, 34...n 


1 — 79, ; 
Pe ‘674.4,9 Sos Bo 
Vn 


i.e. is similar in form to that for a “total” coefficient. 


V. Other methods for determining Correlation. 


1. Four-fold Table. 


(a) When the divisions pass through the means of both 
characters 
r=sin — = 
2\a+b 
This formula (Sheppard’s) is of little practical use, since the 
mean values, in cases where the four-fold table is the only 
method which can be used, are generally unknown. 


(b) In cases where the dividing lines do not pass through 
the means (means not known), the equations for determining 
the correlation are : 


= Oo th Sh 
pee ant 
0 


a+b)—(c+d 2 fe -t? 
( — a a/2 fie ” dy, 


* All this is rendered much easier by P. F. Everitt’s Tables of Tetrachoric 
Functions just published in Biometrika (Vol. vu. pp. 437-451). 


B. 5 
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from which h and k can be obtained by Sheppard’s Tables ; and 


finally 
aK Tah (ht 1) (81) +7 h(t 3B) kk 3 
Ne K 
+720 — (hs — 6? + 8) (ki — 6h? + 3) 
/ oa 2 2 as 2 A 
+ can h (ht — 10h? + 15) k (ht — 10K? 4+ 15) 
fas has io 4 a Gs a Ro 
+ 5040 (h8 — 15h4 + 45h? — 15) (8 — 15k + 4542 — 15) 
+ it 
where H=7—e4 and Kase He 


In obtaining these equations, a normal correlation surface 

is assumed; equation : 
N ae (pele 

Iro.o,V1—r 

The probable error of r obtained by the four-fold table 
method is much larger than that given by the formula of p. 61. 
The correct formula is too complicated to insert here. 

2. Method of Contingency*. 

The following is an example of a contingency tablet. 

Fathers. 


4 = 


Merry | Melancholy | Alternating | Even | Totals 


1 MerEg 5.8 122 8 278 
“% | Melancholy 10 2 29 
sy | Alternating 70 9 248 
Even... 58 6 175 
Totals «4. 260 25 


* Karl Pearson: “On the Theory of Contingency and its Relation to 
Association and Normal Correlation,” Drapers’ Company Research Memoirs, 
Biometric Series 1. 1904; Dulan and Co., London. 

+ Taken from a paper by E. Schuster and E. M. Elderton on ‘‘The Inheritance 
of Psychical Characters (being a further Statistical Treatment of Material 
Collected and Analysed by Messrs G. Heymans and E. Wiersma),” Biometrika, 
Vol. v. 1906-1907, pp. 460-469. 
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The method is employed when the grouping is merely by 
class, and the different classes have no quantitative relation to 
one another,—in other words, when the grouping is merely 
qualitative. The relationship between the two variables is 
measured by the differences between the numbers actually 
found in the various compartments of the table and the 
numbers that might be expected there by pure chance. 


To state the rule: 


The total mean square contingency, ¢°, of the table is given 


by 
‘ NyNg\2 
ee ("9 ae) 
p= Wha Np Ng , 
where 
N,» = total frequency in pth row, 
i=, : » gth column, 


Npyq = frequency of constituent common to pth row and gth column, 
N = total frequency of table. 


Then the coefficient of mean-square contingency, 


C= Cee * 
1+¢ 
In the above case of fathers and sons 
6= 016; 
The probable error of ¢ is very complicated ft. 
For ¢= 0, P.E. = Ss 
VN 


3. Two-row Table. (New methodt.) 


This method, recently devised by Professor K. Pearson, 
gives a unique value of r in the case of two variates one of 
which is both quantitative and continuous (e.g. intelligence) 


* There are certain corrections on ¢?, not mentioned here, which often make 
much difference to the result. 

+ J. Blakeman and Karl Pearson: ‘‘On the Probable Error of Mean-Square 
Contingency,” Biometrika, Vol. v. pp. 191-197, 1906. 

+ Karl Pearson: Biometrika, Vol. vit, 1909, p. 97. 
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while the other, though quantitative, admits of only two 
subdivisions (e.g. into good and bad visualisers), or, in more 
technical language, is “alternative.” 


> » (intelligence). 


Good 


visualisers 


Bad 


visualisers 


The assumptions made are two in number: 
(1) that the regression is linear: 


(2) that the distribution of the alternative variate is 
approximately normal or Gaussian. 


The accompanying diagram (Fig. 14) will make the method 
clearer. JL’ is the regression line of # on y, its equation being 


= oO ane 
“-&=r—(y-Y), 
(on) ‘ 


Putting %, the mean of all the #s=p, =p’, the mean 
value of the array of «’s corresponding to the smaller group 
of y’s (good visualisers), and y—y=q, the distance of the 
centroid of the area RST’ from the mean MM of the Gaussian 
curve, we have the formula: 
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The numerator of this is known, and the denominator 


1a? 


06) — — 
D 2 
y| e * dx 


Xo 


q 


Gy Xx area of part RST” 
es 
te te 
=—-,——e *” | 
nr \Qar 
where m =no. of cases in upper group (area RST), 


and N = total no. of cases. 


Good 
visualisers 


Bad 
visualisers 


Fig. 14. 


? 


In this expression = = : (l1—a) in Sheppard’s Tables 
(Biometrika, Vol. u. p. 182) from which the corresponding 
value of ~ can be read; and the value of z corresponding to this 


C2 
latter result gives 


ies to the $(1—a) of © 


a N = value of z in Sh.’s Tables corresponding 


The above method has not yet been applied in psychology, 
but might advantageously be used in determining the relation- 
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ship of two characters, such as those suggested (intelligence and 
visual imagery), which are both quantitative, but of which one 
only admits of reliable division into two groups. It is the 
smaller of these groups (the “ tail” of the distribution) which 
gives the n; of the formula. In the suggested case, it might be 
represented by the number of individuals, out of the entire 
group (V) measured, who successfully pass certain somewhat 
difficult tests of visual imagery. 

In cases where the #-variate (continuous and quantitative 
in our example above) can only be divided into classes, showing 
no definite order or quantitative relations to one another, the 
y-variate being again quantitative and assumed to follow a 
Gaussian distribution, but alternative, a modification of the 
above method gives 7. Neither r nor the contingency method 
is applicable in such cases. 


4. Short Methods*. 
G1) It can be easily shown that 
Orgy = Ox + Fy? — Way Cx Ty; 
and therefore that 
Og + Oy? — Og y 


a, 
# 202 Fy 


(i) If the distributions are both normal, and if both 
variates have the same mean, m, and the same standard 
deviation, o, then 

7TiS(m+e—m+y)} 
= 1 ee 
IV? Go? ; 
where S(m+a—m-+y) is the sum of the positive differences 
only. In Professor Pearson’s words, the rule is: 


“Subtract from unity 7 times the square of the mean sum 
of the positive differences of paired variates divided by their 
common standard deviation.” 


This method might sometimes be conveniently used in 
determining the “individual” correlation (see below p. 86) 


* Karl Pearson: ‘On Further Methods of Determining Correlation,” 
Drapers’ Company Research Memoirs, Biometric Series ty. 1907. 
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between performances of the same individuals in the same 
mental tests on different occasions. 


(1) Zhe Method of Ranks. 


It has recently been suggested by Dr C. Spearman* that 
measurements of psychical performance may conveniently—nay, 
preferably—be replaced by the numbers representing the rank 
or order of merit of the individuals in the group. On this basis 


the ordinary product-moment formula for 7, BAGLD , can be easily 
Nou. 
shown to reduce to the form 
6S (4, a Ds)? 
_— N(N=1) lel ste ole eliclelalerslocclerereren enero (a), 


where v, and v, are the ranks of an individual in the two series. 


Dr Spearman has suggested a still simpler formula, which 
he calls a “foot-rule” formula. It 1s 


Pais pee ee ee ee (2), 


where S(g) denotes the sum of the “gains” in rank (sum of 
positive differences) of the second series on the first, and 
4 (N?— 1) is the value of the sum of such gains which may:be 
expected by chance. 


In order to make the determination of correlation by R 
comparable with that obtained by the ordinary formula— 
presumably, in Dr Spearman’s view, formula (a)—-he has 
suggested the formula 


This method of using ranks, and the formulae suggested 
therefor, were vigorously criticised by Karl» Pearson in. the 
paper quoted on the preceding page. 

* C. Spearman: ‘‘Measurement of Association between Two Things,’’ Am. 


J.P. Vol. xv. 1904; ‘‘‘Foot-rule’ for Measuring Correlation,” Brit. Journ. of 
Psychology, Vol. u. Pt. 1, July 1906. 


72 THE MATHEMATICAL THEORY OF CORRELATION 


Pearsons Criticisms. 


(a) The direct use of ranks, or order in a series, as a 
quantitative measure of character, is inadmissible. It assumes 
that the unit of rank is equal throughout the scale. But this 
is very obviously not the case. “Between mediocrities the unit 
of rank treated as a measure of a variate is practically zero, 
between extreme individuals it is very large indeed.” Some 
form of frequency distribution must be assumed, and the method 
of ranks assumes that form to be a rectangle. The assumption 
made could hardly be a more improbable one. On the 
assumption of “normal” or Gaussian distribution, Pearson 
shows that the correlation of grades (not quite the same thing 
as ranks) is 


6S (vy, — v2)? 
N(N?—1)’ 
The grade of an individual in a group is measured by the 


number of individuals above him, represented by the shaded 
portion of the following normal curve, thus: 


where p=1-— the expression (a) above. 


Fig, 15. 


Hence the quantities to be correlated are 
lz 


n-orade, naw +5 [oe *"de, 
TO; 


oa 
and y-grade, G2=$N 4+ N | e 205 dy, - 
N Qaro> - 


0 


2 we NE 6S (vy, — v,)? 
giving ry = 2 sin % E oe Me =I sibene (a and 6). 
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In terms of the sum of positive differences of ranks (“gains” 
in rank) the formula is 


edu Step 
Z = 2 008 2ar |} 1 


=2cos5(1—R)-1 Pe Reunance (e). 


This, then, 1s the formula which should take the place of 
Spearman’s formula (y). 


(6) With small values of V, R retains the same value for 
wide variations in p [expression (a) above]. 

(c) Spearman obtains the value 2 for the probable error 

n 


n/ 


of # for zero correlation, empirically, by noting the distribution 
of a large number of chance values of S(g)*. In the case of 
some of these, the corresponding values of f& are negative. 
Now the negative values of & only range from 0 to—°‘5. Hence 
the value which he deduces for the probable error, being that of 
a quantity ranging from + 1 to —°5, 1s not comparable with the 
probable error for the product-moment formula (range, +1 
to—1). The comparative accuracy of the different methods may 
be deduced from Pearson’s formulae by using differentials, and 
Pearson has proved that the probable error of p for zero 


correlation = “SS - He is thus able to show that, for zero 
n — 
correlation, 
P.E. of r deduced from p= pei : 
Mi 
and PE. ,, 7 8 epee We 
Vn —1 
Thus the rank methods are less accurate than the product- 
moment method. “In particular it requires about 30°/, more 


observations by the & method to obtain r with the same degree 
of certainty when r= 0.” 

(d) When R& works out negative, Spearman recommends 
that one of the series be reversed. But since Ff is not a 


* Brit. Journ. of Psychology, Vol. 11. Pt. 1. 1906, pp. 107, 108. 
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symmetrical function of p, this is not a sound course to adopt. 
In fact, if the numbers are small, reversal of one of the series 
may yet leave J? negative. Pearson’s own formula (e) is free 


from this drawback *. } ay 
- 


An important argument brought forward by Spearman‘ is 
one to the effect that absolute measurements are often hetero- 
geneous and that ranking removes this heterogeneity. There is 
an assumption underlying this argument which would seem to 
be of somewhat doubtful validity. All, or almost all, so-called 
mental measurement is measurement of the object. It is 
physical, not mental at all. The physical objects, the physical 
results can be measured—in terms of space and time—and they 
can be correlated. Psychical measurement has perhaps some 
plausibility in the case of sensation intensities, in the form of 
the “sense-distances” of Delboeuf; but this furnishes small 
justification for the claims made above on behalf of ranks. The 
unit of rank means nothing until the form of distribution is 
fixed, and this can hardly be determined on a purely psychical 
basis; at least it 1s difficult at present to conceive of even an 
attempt at such a determination. 


Another important passage occurs on p. 99 of the previously- 
quoted article. “In this simplified form the standard or 7 method 
OS (v1, — v2)? 

N (N?—-1) ’ 


which is p, not the standard or r method; r=2sin a P) | 


| Spearman apparently means the formula 1 — 


shows itself to be solely distinguished from our short or 
method by the fact that the differences of rank are squared. 
The effect of squaring is to give more ‘ weight’ to the extreme 
differences as compared with the medium ones. This is 
probably a considerable advantage in most physical measure- 
ments. But in other fields of research, and perhaps above all 
in psychology, these extreme cases are just the ones of most 


* Spearman endeavours to meet some of these criticisms in a recent paper ; 
British Journal of Psychology, Vol. tt. Pt. 3, Oct. 1910. 

+ C. Spearman: ‘‘‘Foot-rule’ for Measuring Correlation,” British Journal 
of Psychology, Vol. u. Pt. 1. July 1906, p. 93. 
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suspicious validity [why ?] so that squaring is here more likely 
to do harm than good.” 


Such an argument is sound as applied to actual measure- 
ments, but as soon as those measurements have been translated 
into ranks, it loses its force. The extreme ranks are more 
reliable than the medium ones since they are farther apart from 
one another, and therefore the extreme differences are more— 
not less—reliable than the medium ones. 


Finally, Spearman says that he deduced his formula 


5 T 
7 = sill ie R) 
empirically from 111 correlations, the average number of cases — 


in each being 21. Now, for n=21 we have seen that the P.E. 
18 ae even larger than that given by the usual formula, 


67449 | 


F a Hence, the chance that the selected formula 
n 


was the best one can not have been great. Doubtless many 
others would have fitted equally well; and if Spearman wishes 
to support his formula as against Pearson’s on the strength of 
the fact that it was deduced empirically from actual cases of 
psychical correlation and therefore was likely to be best suited 
to the forms of frequency distribution commonly followed by 
psychical measurements, such an argument would be of little 
cogency. 

In all work where accuracy, and, above all, thoroughness are 
required, the method of ranks should be avoided since it gives 
neither the mean nor the standard deviation—quantities i1m- 
portant in themselves and valuable as a means of obtaining the 
coefficient of variation of the group in the particular mental 
functions under consideration. Ranks hide variations of great 
psychological importance, and introduce a spurious homogeneity 
which can well be dispensed with. When the number of cases 
is large, ranking takes too long a time, and the grouping of the 
individuals in a correlation table gives much more information 
as to the nature of the correlation. Even when the numbers 
worked with are comparatively small (say 30—100), means, 
standard deviations, and coefficients of variation are results of 
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considerable intrinsic value, and in finding the standard devia- 
tion the deviations from the means have to be written down, so 
that everything needful for the calculation of r is already 
determined, and the evaluation of correlation coefficients 
between the various series of measurements taken in pairs is an 
operation quickly performed. Another reason for drawing up 
a correlation table whenever possible—and this of course is 
impossible with ranks—is that only thereby can the correlation 
ratio, n, be calculated, a result which tests the linearity or non- 
linearity of the regression, and is moreover a better measure of 
causal relationship than 7 itself. 


Only when n is small (< 30) are ranks to be recommended 
in preference to actual measurements (since means, 8.D.’s, etc. 
are in these cases of little value), and then p is as easily 
calculated as R, and more reliable. The results with such 
small numbers could do little more than indicate the eaistence 
of correlation in certain cases. They could give practically no 
information as to its s¢ze owing to the large amount of probable 
error. 


VI. Spurious Correlation. 


Correlation is said to be “spurious” when it is due to ex- 
traneous conditions and does not arise directly out of the two 
functions under consideration. The term is one of relative 
rather than absolute significance, but its appropriateness will 
become apparent after a consideration of the following two 
examples : 


1. Spurious correlation arising from heterogeneity of 
material. 


Let us suppose that two distinct groups of children have 
been measured for two characters A and B, and that the mean 
abilities, in both A and B, are higher in the one group than in 
the other. Then, even if there is no correlation between the 
two characters, as estimated from each group, separately, a 
positive correlation will be obtained from the two groups taken 
together. On the other hand, if the mean ability in A is higher 
and the mean ability in B lower, in the one group than in the 
other, a negative correlation will be obtained by taking the two 
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groups together. The correlation in each case will be due 
simply to the heterogeneity of the material employed. The 
difference in the mean values for the two groups must of course 
have some cause, such as a difference of nationality, sex, or 
even locality (within any one town or district) from which the 
children are drawn; or, again, such a cause as the difference of 
discipline to which the two groups have been subjected in the 
past. These are extraneous conditions and, if measurable, can 
be allowed for by employing the method of partial correlation. 
Asa rule, however, they are not easy to determine quantitatively ; 
hence their dangerous character. 


2. Index Correlation*. 


This is a form of spurious correlation which arises from the 
use of ratios for measurements. Thus if 


and #,, >, #; are uncorrelated with one another, it can be shown 
that the correlation between z, and z,, 


VEO Ve 
Dy / Le Xs Ug 
a quantity which may be as large as ‘5. 


As an illustration from psychology, we may mention the 
correlation of errors of observation, either of different individuals 
or of the same individual at different times. Even if the 
absolute errors are entirely uncorrelated, the percentage errors 
(errors expressed as percentages or other fractions of the 
observed quantity) will often show a large amount of cor- 
relation. In such a case as this, the important question arises 
whether we are, from other considerations, justified in using 
ratios (or indices) rather than absolute measures. If we are, 
the index correlation 1s not to be regarded as spuriousf. 


* Karl Pearson: ‘‘On a form of Spurious Correlation which may arise when 
Indices are used in the Measurement of Organs,” Proc. Roy. Soc. Vol. ux. 1897, 
p. 489. 


+ Spurious correlation of this (as of any) kind can be eliminated by partial 
correlation. 
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VII. The Significance of the Correlation Coefficient. 


A correlation between two series of measurable characters 
may be regarded quite generally as due to some sort and 
degree of community or identity of causation, and the coefficient 
of correlation, 7, 1s some sort of measure of this identity of 
causation. We may, then, quite legitimately inquire what 
numerical relation 7 bears to the identity. To take an example 
from Psychology: when we find that the (partial) correlation 
between performance in Algebra and performance in Arithmetic 
is ‘75, does this mean that 75°/, of combined algebraical ability 
and arithmetical ability 1s common to the two, the mental 
operations being practically identical in the two subjects for 
this portion, or does the ‘75 measure the degree of identity in 
some more complicated way ? 


The general solution of the problem appears to be impos- 
sible. Its meaning may, however, become clearer after a 
consideration of a particular case in which the significance of 
the correlation coefficient 7s apparent. The case referred to 1s 
an experiment in dice-throwing devised by the late Professor 
Weldon*, and called after him Weldon’s experimentt. Let a 
dozen dice be shaken up in a box and thrown again and again, 
the number of dice showing four or more spots on the upper- 
most face in each throw being recorded. Then if the results 
of the 1st, 3rd, 5th, etc. throws, and those of the 2nd, 4th, 6th, 
etc., be written down in the form of two series, the 2nd 
opposite the Ist, the 4th opposite the 8rd, etc. these two 
series will obviously show no correlation, since the result of the 
2nd throw is in these circumstances quite independent of the 
result of the 1st, and likewise with the 4th and 8rd, the 6th 
and 5th, etc. Let now the second of each of these pairs of throws 
be made dependent on the first of each pair, in the following way. 
Let a certain proportion of the dice, say 6, be stained red, and 
then, after throwing all the dice and counting indiscriminately 


* W.F.R. Weldon: ‘Inheritance in Animals and Plants” in Lectures on 
the Method of Science, edited by T. B. Strong, Oxford, Clarendon Press, 1906, 
pp. 100-106. 

+ A. D. Darbishire: ‘‘Some Tables for illustrating Statistical Correlation,” 
Memoirs and Proceedings of the Manchester Literary and Philosophical Society, 
Vol. u1. Pt. 111. 1906-1907. 
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all those showing 4 or more spots on the uppermost face, for 
the second throw gather up only the white dice, leaving the 
red upon the table, and throw, counting however both white 
and red dice when computing the score. The number due to 
the red dice will thus be common to the two scores. Successive 
pairs of throws are then made in the same way. If now the 
two series of values be compared, they will be found to be 
correlated together; a result to be expected, since the score of 
the second throw of each pair was to a certain extent dependent 
on that of the first. The actual value of the correlation 
coefficient will be found to be, in the case taken, =-6, or ‘5. 
Quite generally, if is the total number of dice thrown, and / 
the number left on the table after the first of each pair of 
throws, 


C= 


This result admits of a theoretical proof as follows: 


Let the successive scores of the pairs of throws be 


i is a 
Ay + Cy Co + by 
dg + C3 C3 + bs 
an + Cy cy + by 
and let L=Aa+¢, y=b+e. 
2 
Then De ie BESS 
20 x0 y 
: ME yy  S(O—by _8(a?). 8") : 
oe ee ae ae ae; 
SJ 2 S 2 S 2 
oo ae ott of, 
S(b S 2). Ste 
oy = se = — ) es OO) ao toe, 
Og oo 
oe Fey = 


a 2 (o2+ o;°) v3 oy + ce 
Let 7 = no. of dice left on table 
b+ Mm=n. 


a ee, thrown twice 
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Then, since the distribution of results in the throwing of n 
dice NV times is given by the successive terms of the binomial 
expansion of 

N (p+ 9)", 
of which the s.p.=V/npq (putting N equal to unity), 


o, = lpg, o,°= mpq, 
l l 
and tag Thang = 
In the case of Weldon’s experiment, then, the proportion of 
factors common to the two series is given by the correlation 
coefficient itself, but in the cases ordinarily met with in biology 
and psychology the relations of the factors at work must be 
much more complex. The extent of identity of causation is 
measured by ¢(r), some function of r, but the form of this 
function is unknown to us. It is, for example, an unwarranted 
assumption to say, as Spearman does, that the square of the 
correlation coefficient measures “the relative influence of the 
factors in A tending towards any observed correspondence 
(with another variable 6) as compared with the remaining 
components of A tending in other directions*.” 


* C. Spearman: “The Proof and Measurement of Association between Two 
Things,” American Journal of Psychology, Vol. xv. Jan. 1904, p. 75. 


CHAPTER II 


HISTORICAL 


The history of the use of the theory of correlation in 
Psychology can hardly be said to have begun earlier than the 
commencement of the present century. During the previous 
twenty years, indeed, a great deal of work had been done by 
many observers in measuring simple mental abilities (by the 
“mental test” method) in larger or smaller groups of subjects, 
and attempts had even been made to determine in what way 
these abilities were related to one another and to more general 
mental ability, or “ general intelligence.” 


Owing, however, to a universal lack of knowledge of the 
mathematical theory of correlation among psychologists during 
this period, the results were not obtained in a form suitable for 
comparison with one another, so that it is not surprising to 
find that they hopelessly contradict one another. The hetero- 
geneity of the material worked with, the non-elimination of 
irrelevant factors, and the absence of any measure of the 
“probable error” of the results make the conclusions drawn by 
the investigators themselves from thev researches utterly 
unreliable. 

The first investigation showing any mathematical precision 
was that published by Clark Wissler* in 1901. It contained 
(inter alia) an account of the careful application of a large 
number of simple mental tests upon over 200 college students, 
and a correlation of the results with one another and with the 
students’ marks in the various subjects of the college curri- 
culum. The mental tests were found to correlate but slightly 
with one another or with ability in college subjects of study, 


* Clark Wissler: ‘‘The Correlation of Mental and Physical Tests,” 
Psychological Review, Monograph Supplement, Vol, 111. No. 16, June 1901. 
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though these latter showed considerable correlation with one 
another (‘'30—‘75). 

In the following year Aikins and Thorndike* published 
results which were in a sense confirmatory of those of Wissler, 
since, notwithstanding the greater similarity to one another 
of the functions investigated than in Wissler’s research, the 
correlations were again found to be low. For example, different 
tests devised for the measurement of “speed of association ” 
were found to show hardly any correlation—a result which 
seems to furnish some justification for the authors’ statement 
that “quickness of association as an ability determining the 
speed of all one’s associations is a myth” (op. cit. p. 375). 
A similar lack of close relationship was found in the case of 
other mental functions which would, on the evidence of intro- 
spection alone, be confidently classed as particular instances of 
the same general mental function. 


In C. Spearman’st researches, published in 1904 and 1906, 
a change in tactics appears. Instead of measuring large 
numbers of individuals, as his predecessors had done, Spearman 
contents himself with small numbers, groups of less than 40 in 
his first research, and as few as 11,—1in the case of one of the 
tests, 7—in the second; but he proposes to make up for the 
unreliability thus introduced into the results by a more careful 
measurement of his cases, and the application of “corrections ” 
to the “raw” values of his correlation coefficients by means of 
appropriate mathematical formulae. 


The “corrections” are of two kinds. One is that for the 
“elimination of irrelevant factors” and is nothing new in the 
theory of correlation, being simply the method of “ partial” 
correlation (see p. 63 above). 


For example: To eliminate effects of difference of age in 
the group experimented on one would determine the “ partial ” 


* «Correlations in the Perceptive and Associative Processes,” Psychological 
Review, Vol. 1x. 


+ C. Spearman; ‘‘‘General Intelligence’ Objectively Determined and 
Measured,” American Journal of Psychology, Vol. xv. 1904, pp. 201-292. 
F. Krueger and C. Spearman: “Die Korrelation zwischen verschiedenen 


geistigen Leistungsfahigkeiten,” Zeitschrift fiir Psychologie, Bd. xxiv. 1906. 
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correlation between the two characters under consideration, for 
“age constant.” Thus, in Yule’s notation, if 1 and 2 represent 
the characters to be compared and 3 represents “age,” 
= Vie — Tig ae 

V(1 ie i) (t haa 755) 


A similar procedure is needed for eliminating the eftects of 


T12.3 


differences of sex, etc. A preferable course, however, would be 
to dispense as far as possible with the necessity for such 
correcting by selecting groups of individuals of the same age, 
sex, etc. 


The other method of correction advocated is that for the 
“elimination of observational errors.” Obviously, errors of 
observation must make any correlation, worked from measure- 
ments containing them, different from (generally, though not 
universally, less than) the true value of the correlation. The 
formula, in its full form, which Spearman has proposed as a 
means of correcting for this, and of which he has given a proof 
in the Am. J. P.*, is as.follows: 


Fe > 
tay TaYo Vay Vay 


Voy = , 


eee TyY> 


where #, y are the true values to be correlated, and a, @, Yi, Ys 
are two pairs of obtained values. 


yc, and similar coefficients, are called by him “ reliability 
coefficients.” They represent the correlation of two distinct 
series of measurements of the same mental capacity. 


The proof of Spearman’s formula is only valid on the 
assumption that the errors of measurement are uncorrelated 
with each other or with wor yf. Thus, if a, y, 6, € represent 
deviations from means, and 


“,=a24+6,, Lo = 0 + O,, 


YW=yt+a, Yo=Y + &, 


* C. Spearman: ‘‘Demonstration of Formulae for True Measurement of 
Correlation,” American Journal of Psychology, Vol. xv11. 1906. 

+ See Mr G. Udny Yule’s short proof of the formula, quoted in my pamphlet 
on ‘‘Some Experimental Results in Correlation,” Comptes Rendus du VI 
Congres International de Psychologie, Genéve, Aotit 1909, 


6—2 
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then S (a6), etc. = 0, 9S (0,05), Shes) 
also S Gy,) =8 (ag) = a). 
[All this 1s involved in Mr Yule’s proof. ] 


Now, these are very large assumptions to make. Even in 
cases where the quantities 6, « are genuine errors of measure- 
ment, there are strong reasons for assuming (on general 
principles and also from experimental evidence)* that they will 
be correlated. But in the case of almost all the simpler mental 
tests the quantities 6 and e¢ are not errors of measurement at 
all. They are the deviations of the particular performances 
from the hypothetical average performance of the several 
individuals under consideration. Thus they represent the 
variability of performance of function within the individual. 
When an individual in the course of three minutes succeeds in 
striking through 100 e’s and 7’s in a page of print on one day, 
and 94 under the same conditions a fortnight later, there 1s no 
error of observation involved. The numbers 100 and 94 are 
the actual true measures of ability on the two occasions. The 
average or mean ability, which is the more interesting measure 
for the purposes of correlation, is doubtless different from 
either, but that does not make the other two measures erroneous. 
Evidently in these cases 6 and ¢e represent individual variability, 
and to assume them uncorrelated with one another or with the 
mean values of the functions is to indulge in somewhat @ priori 


reasoning. 


There are two comparatively simple ways of testing the 
assumption : 
C1) S(ay)=S8 (ay) = 8 (ways), 
*. S(amy,)—S(xey.) should =0 within the limits of the 
probable error of the difference. 


I have applied this test to the case of correlation between 
accuracy in bisecting lines and accuracy in trisecting them in 
43 adult subjects. 


* See Karl Pearson: ‘On the Mathematical Theory of Errors of Judgment, 
with special reference to the Personal Equation,” Phil. Trans. A, Vol. 198, 
pp- 235-299. 
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Here S (b,t,) — S (Bats) = 137780 — 60036 
= 77744, 


p.B. of S (xy) =°67449 af fois 
in Pearson’s notation. 
_ 67449 /S(ayP  S(#) S(y’) 
ee n n2 : 


P.Bret iS'(0,2;) = 687 Pe. of 8 (b,)= 365. 


‘, BE. of S(b,t,) — 8 (bf2) = V687? + 3652 
=o: 

Since 778 is less than one-third of 77744, the formula 
cannot be employed to obtain the correlation between mean 
abilities in bisecting and trisecting lines. 

(2) rx -x,= S ((@, = (Y% — Ye)} 
iG = "5 V8 co, Sa Hay ° S (Y ry Yo)” 
7 S {(6, — 6) (eg — €2)} 
v'S (6, — 6,)?. S(e —e&) 
=) 
if errors are uncorrelated with one another (since numerator 
then = 0). 


Applying this test to the same case of bisection and tri- 
section, I get 


‘2-3 0°30 + 0:09, 


L.-T, 
which proves once more the inapplicability of the formula. 

I applied test (2) also to the case of correlation between 
speed of addition of figures and accuracy of addition in a group 
of 38 school children (girls between the ages of 11 and 12, 
Group II below) and found 

Ly Meee a 0°35 + 0:09. 
A, ~ Ay 

Even when test (2) does give the value 0, we can only 

conclude from this that 


S (8,¢,) + S (d,€.) = S (d,¢,) + S (826). 
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We cannot conclude that the formula is applicable, unless 
we have further independent evidence*. 


For the reasons presented above, I should prefer to avoid 
the use of Spearman’s formula—by increasing the number of 
the original measurements of each abilityf—and would also 
suggest that his so-called “reliability” coefficients might in 
some cases be more appropriately termed “ coefficients of indt- 
vidual correlation,’ since they are more analogous to Karl 
Pearson’s “correlation of undifferentiated lke parts” than to 
anything elsef. 

The above discussion raises the interesting question as to 
the relation between ability and variability, and the correlation 
coefficient between mean ability and the standard deviation 
would be the best measure of this relation. When the measure- 
ments have been made on only two separate occasions, the 
expression 7x, +x, might be regarded as a rough measure of the 


a Noege g 


relation, and I would suggest that it be called a “ variability 
coefficient >—not to be confused with the “ coefficient of varia- 


tion,” which = —— 


. If a and a, are chance values, and if 
mean 


the distribution of abilities at the given task within one and 
the same individual is approximately normal, then 


Hy ~ Hy = H50 (approx.)§, 
so that there is sufficient justification for this value. 


To return to Spearman’s researches. In his first series of 
investigations (Am. J. P. Vol. xv.) the following groups of 
subjects were measured: 24 village school children of both sexes, 


* In a recent paper (British Journal of Psychology, Vol. 111. Oct. 1910, 
pp. 271 ff.) Spearman admits the difficulties attending the use of his ‘‘ correction 
formula,” and brings forward another formula to correct for “accidental” as 
distinct from ‘‘constant” errors. Since no experimental evidence is given as 
to the validity of the assumptions which this new formula involves, I merely 
content myself with mentioning the fact of its publication here. 

+ This plan would also have the advantage of keeping the probable error 
low. Correction by Spearman’s formula while raising the value of the coefficient 
raises the size of the p.n, in the same proportion. 

+ Grammar of Science, 2nd Ed., pp. 3938, 397. 

§ Karl Pearson: ‘‘ Francis Galton’s Difference Problem,” Biometrika, Vol. 1, 
p. 399. 
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age limits 10:°0—13:10; 23 boys of a high class preparatory 
school, age limits 9°5-—13'7; and 27 adults of both sexes, age 
limits 21—78. The tests employed were those for pitch disceri- 
mination, weight discrimination and discrimination of light 
intensities; and measures of intelligence were obtained, in the 
case of the children, from results of school examinations, grading 
by teachers, and grading of one another by the children them- 
selves (measure of common sense), Only in the cases of 
“common sense” (village children) and “final school order” 
were reliability coefficients calculated. Consequently the reli- 
ability coefficient for common sense (64) was used to bring up 
the various sensory discrimination coefficients to their true 
values. Corrections were made for age and sex, and the final 
results were then found to be comparatively high in the village 
school group. 

In the preparatory school group the highest raw coefficient 
was 0°32 + 0:08, that between weight discrimination and lhght 
discrimination ; in the group of adults that between pitch discri- 
mination and light discrimination was the highest, being likewise 
0°32 +0°08. The various school subjects in the preparatory 
school (Classics, French, English, Mathematics) were found to 
correlate highly with one another, and when, with the inclusion 
of pitch discrimination and music, they were arranged in rows 
and columns in order, from left to nght and from above down- 
wards, of the average degree of the correlation of each with all 
the rest, they were found to form a hierarchy such that every 
coefficient was greater than any to the right of it (in the same 
row) or below it (in the same column). The order was Classics, 
French, English, Mathematics, Pitch Discrimination, Music. 
Spearman takes this to be one form of evidence for the unity 
of the intellective function, defined as follows :—“ Wherever 
branches of intellectual activity are at all dissimilar, then their 
correlations with one another appear wholly due to their being 
all variously saturated with some common fundamental function 
(or group of functions)*.” He considers that by the use of his 
formula for the correction of observational errors he can demon- 
strate the same thing (the existence of a “central function”) 

© Ait. Soke. VOL EV pact. p, 278. 
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and can in particular show “that the common and essential 
element in the intelligences wholly coincides with the common 
and essential element in the sensory functions*.” The method 
of proof is as follows : 


Let %,, #, be two distinct measures of sensory discrimination, 
and ¥,, y2 two distinct measures of intelligence. 


Then the correlation of the function common to the functions 
measured by 2, 2, with the function common to the functions 
measured by ¥;, Y2 18 equal to 

OF eater rao 
oe Tevys Vaccyy Vays 


p) 


Ninn TynYs 


and if the two functions referred to are identical this expression 
should be equal to unity. 


In the present article, Spearman uses a simplified formula, 


Voy 


a Tyyys , 
and puts for the numerator the average of the various correla- 
tions evaluated between the intelligences and the discrimina- 
tions, and in the denominator puts 
x,2, = the average correlation of the intellective gradings 
with one another, 
Yyy,= the average correlation of the gradings in discri- 
mination with one another, 
and in this way gets results approximately equal to 1 in the 
different groups tested. 


One or two remarks may appropriately be made here. In 
the first place, the full formula is the only one that can be used 
with any meaning or justice since it is the only one which issues 
logically from the mathematical proof. In the second place, 
the applicability of the true formula must be considered in the 
light of its presuppositions (mentioned above, p. 83). 


S(ay,) should = S(a,y,) within the limits of probable error, 


and 7,,~~, Should be zero. Indeed, the assumption that 8 and e 
Gigs 


are uncorrelated with each other or with «a or y seems even 
“Op. Cit. p. 269. 
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more unwarrantable here than in the case of “correcting” 
coefficients, for which the formula was originally devised. 

In Spearman’s second research*, the subjects experimented 
on were eleven adults, and the measurements made were : 

(1) the spatial threshold, 

(2) addition of figures, 

(3) memory of the order of a series of figures previously seen, 

(4) combination method of Ebbinghaus (only 7 subjects 

took this test), and 

(5) discrimination of pitch. 

After corrections for disturbing but irrelevant factors it was 
found that (1) and (3) showed no correlation with each other or 
with (2), (4) and (5), but that these latter correlated highly 
with one another (72, = ‘70, 72; = 68, 7,; = 64), and the coefficients 
could be arranged to show a hierarchical order. The raw values 
were found to give evidence of a “central factor” when tested by 
Spearman’s correction formula, though it is true that the 
arithmetic mean instead of the geometric mean was evaluated 
in the numerator (an entirely unjustifiable procedure), thus 
evading the difficulty that would have otherwise arisen owing 
to the presence in it of a zero valuet. After working up some 
further data, taken from Oehrn’s writings (number of subjects 
employed, 10), and finding the results in agreement with the 
hypothesis of a “central factor,” and finding too that the correla- 
tion coefficients for the same pair of abilities increase with increase 
of practice and thus contradict the view of Binett{ that correlation 
in mental tests is the result of lack of practice, and diminishes 
in size with increase of practice, Spearman concludes his paper 
with the suggestion (made “with the greatest reserve ”) that the 
central factor is a plastic function—“ plastische Funktion ”—of 
the nervous system which varies from individual to individual 
and exists as a determining influence for all the various psycho- 
physiological functions of the individual. Whatever it is, it is 
not voluntary attention (p. 106), as Binet believed it to be. 


* Krueger and Spearman: Zeitschr. f. Psych. Bd. xuiv., op. cit. 

T Op. ett. p, 85. t LD’ Année Psychologique, vi. p. 395, 1899. 

§ It is perhaps hardly necessary to point out that the detailed consideration 
of Dr Spearman’s work given here and elsewhere in the present book is, and is 
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Spearman’s theory of an identical or absolute correspondence 
between “general discrimination” and “general intelligence,” 
has been recently put to the test by Messrs Thorndike, Lay and 
Dean*. The subjects examined were 387 young women students 
and 25 high school boys. The tests for sensory discrimination 
were : 

(1) drawing lines equal to given lines, and 

(2) filling boxes with shot to equal in weight standard 

weights ; 
those for intelligence were : 
(3) judgment of fellow-students, and 
(4) judgment of teachers. 


For the high school boys (3) and (4) were combined teachers’ 
and fellow-students’ judgments, and school marks, respectively. 

In the first case, Spearman’s formula gave for the correla- 
tion of the factor common to (1) and (2) with that common to 
(3) and (4) the value 0°26 instead of 1:00. In the second case, 
the value was 0°29. Moreover, Thorndike found a much higher 
correlation between discrimination of lengths and discrimination 
of weights than between either one of them and general intelli- 
gence, the coefficients being 

Accuracy in drawing lines, intelligence ... ... O15, 

Accuracy in making up weights, intelligence ... 0°25, 

Accuracy in drawing lines and making up weights 0°50. 

Thus the results were in decided conflict with both parts of 
Spearman’s concluding statement “that all branches of in- 
tellectual activity have in common one fundamental function 
(or group of functions) whereas the remaiming or specific 
elements of the activity seem in every case to be wholly 
different from that in all the others+.” 


meant to be, a tribute to the outstanding importance of that work. No one 
can appreciate more vividly than does the writer the valuable suggestiveness of 
all Dr Spearman’s writings, above all in respect to the special precautions 
therein emphasized as requisite in measuring psychical (as distinct from 
anatomical, etc.) correlation. 

* Thorndike, Lay and Dean: ‘‘The Relation of Accuracy in Sensory 
Discrimination to General Intelligence,” Am. J. P. Vol. xx. July 1909, 
pp. 364-369, 

+ C. Spearman, Am. J. P. Vol. xv. op. cit. p. 284. 
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Thorndike sums up as follows: “In general there is evidence 
of a complex set of bonds between the psychological equivalents 
of both what we call the formal side of thought and what we 
call its content, so that one is almost tempted to replace Spear- 
man’s statement by the equally extravagant one that there is 
nothing whatever common to all mental functions, or to any half 
of them *.” 

Quite recently (Dec. 1909) there has appeared a detailed 
account by Mr Cyril Burt} of some experiments on children, 
“commenced with a view to testing in practice the mathemati- 
cal methods of Dr Spearman, and to verifying the experimental 
results both of Dr Spearman and of Prof. Meumann” (op. cit. 
p- 95). 

Two groups of subjects were experimented on: (1) a group 
of 30 boys (age limits, 12°6—13°6) from an elementary school, 
and (2) a group of 13 boys (same age limits) from a prepara- 
tory school. The mental tests employed were discrimination of 
(1) weights, (2) pitch, and (3) lengths of lines (W. H. R. Rivers’ 
adjustable apparatus used), measurement of (4) “spatial 
threshold,” (5) rate of tapping (modification of Binet’s “ petits 
points”), (6) card-dealing, (7) card-sorting, (8) alphabet-finding, 
(9) immediate memory, (10) “ mirror test,” (11) “spot pattern ” 
test (tachistoscopic: McDougall’s apparatus), (12) “dotting ” 
test (apparatus used: “dotting apparatus” of McDougall, 
devised for testing and recording continued maximal voluntary 
concentration of attention). A measure of “imputed intelli- 
gence’ was also obtained, from judgments of teachers and 
school-fellows. 

‘Tests (1), (2), (3) and (4) were classed as sensory tests. 

Tests (5) and (6) were classed as motor tests. 

Tests (7) and (8) were classed as sensori-motor tests. 

Tests (9), (10) and (11) were classed as association tests. 

Test (12) was classed as a test of voluntary attention. 

The tests were applied to the children individually, and 
repeated at least once. The reliability coefficients were in most 


* Thorndike, Lay and Dean: op. cit. p. 368. 
+ Cyril Burt: ‘Experimental Tests of General Intelligence,” British 
Journal of Psychology, Vol. 1. Parts 1. and 11. Dec. 1909, pp. 94-177. 
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cases high, only those of tests 1 (preparatory school), 3, 5, 7 and 
8 (preparatory school), 10, and 11 being under ‘6, and of these 
only 7 and 8 (preparatory school) were lower than ‘5; so that 
the average ability of each individual tested was determined 
with fairly close approximation. 

The sensory tests were found to correlate but slightly with 
one another (see p. 177, foot of page), and the highest correlation 
with imputed intelligence was that of pitch-discrimination in 
the elementary school, which = 0°40 + 0:10. The remaining tests 
correlated much more closely with one another and with imputed 
intelligence. The chief general result of importance which the 
author considers himself to have demonstrated conclusively is 
that the coefficients form a “hierarchy” (see p. 87 above) in 
both groups of subjects tested. Considering this result due to 
the existence of a “central factor,” he suggests that the central 
factor indicated is “ voluntary attention,” since the dotting test 
(no. 12) heads the hierarchy in both cases, and also “the intelli- 
gence coefficients for successive series with the same test” 
dimumsh from the first onwards (except in the case of tests 5 
and 11). 


This conclusion contradicts Spearman’s view*, and coincides 
with the view of Binet*. 

Psychologically the research is excellent. The tests are in 
many instances original, and the possible psychological factors 
involved in each are debated with exhaustive completeness. 
With regard to the “ statistical” side of the work, I should like 
to make a few very brief remarks: 

(1) The number of individuals tested in the preparatory 
school (13) 1s far too small. The size of probable errors is not 
known for such small numbers, though it must be large. Again, 
the two groups tested are not comparable to one another. It 
is a pity that more groups of each kind of school were not 
examined, in order to obtain results that would be comparable 
and would serve as “controls” of one another. 


(2) It is a pity that Spearman’s formula, r = sin B R) 


* See above, p. 89. 
t+ L’Année Psychologique, vt. p. 395. See also above, p. 89. 


HISTORICAL 93 


should have been employed (see above, pp. 71—76). In par- 
ticular, the proof of the probable error of this value given at the 
foot of p. 109 is incorrect. The true value of the P.E. of r 


calculated from R (for zero value of the coefft.) is mae not 
11) — 


"6745 
Vn 
(3) It is very doubtful whether the results do prove the 
existence of a “hierarchical” order. The orders for the elementary 
school and for the preparatory school do not correspond at all 
closely, the rank-correlation, p, working out to 0°56 only. The 
plan of comparing average deviations and average P.E.’s on 
p. 163 is open to serious objection (apart from the fact mentioned 
in footnote 2 of the same page). 
(4) The standard method for investigating the inter-rela- 
tions of the several coefficients would of course be that of 
multiple or “partial” correlation* ; but this method was not 


(see above, p. 73). 


employed. 

(5) The probable errors of the quantities in Table VIII, 
p. 172, should have been given, and taken into consideration in 
drawing inferences from them. 


These are the few difficulties that occur to one on a rapid 
reading of the paper, though they detract but httle from its 
general usefulness. 

The results of a very detailed investigation into the relation- 
ship of intelligence to the size and shape of the head, and 
to other physical and mental characters, by Prof. Karl Pearson f, 
appear in Biometrika, Vol. v. 1906—1907. The subjects 
measured were 1000 Cambridge graduates and considerably 
more than 5000 school children. Special care was taken in 
drawing up a quantitative scale of intelligence, adjustments 
being made so that the results fitted a “normal” or Gaussian 
distribution. The correlations were worked out in several 
different ways,—correlation coefficient (r), correlation ratio (7), 
coefficient of mean square contingency, and the method, first 


* See above pp. 51, 52 and 63, 64; also Appendix m1. 

+ Karl Pearson: ‘‘On the Relationship of Intelligence to Size and Shape of 
Head, and to other Physical and Mental Characters,’ Biometrika, Vol. v. 
1906-1907, pp. 105-146. 
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suggested in this paper, of the analograph. Pearson describes 
this last method as follows: “In the case of intelligence, I take 
a normal scale as my base line and plot up the percentage of the 
character for each grade of intelligence along the centroid 
vertical of the corresponding range, drawing a horizontal line 
to represent the mean percentage in the population at large, 
We thus obtain a diagram, which I will venture to call an 
analograph. 


“Tf the percentage increases or decreases continually with 
intelligence (or with the base character, whatever 1t may be), 
I term the relationship homoclinal; if the percentage does not 
reach its maximum with the maximum or minimum of intelli- 
gence, I term the diagram heteroclinal *.” 


The mental characters investigated (which alone interest us 
here) were—temper, popularity, self-consciousness, shyness, 
conscientiousness, quiet habits, and handwriting, and in the case 
of the school children the correlations of intelligence with these, 
and also with age, were found to be as follows: 


Character Boys | Girls 
Conscientiousness “46 | ‘43 | 
Handwriting... ‘28 | ‘30 | 
Popularity sigs "22 | 30 
Temper .... see “19 "22 
Shyness ... ae ‘03 18 
Self-Consciousness ‘10 03 
Tie: ata 05. ~ | Sends 
Quiet Habits... 04 09 


The low value of the correlation of intelligence with age 
should be noted. 

Space does not allow of further consideration of Karl Pear- 
son’s paper. It is undoubtedly the most important contribution 
that has hitherto been made to the subject of the relation of 
intelligence to other mental and physical characters in the 
human organism. 

Other articles dealing with psychological classifications and 
measurements, which have recently appeared in Biometrika, are 

* Op: cit. p. 120: 
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two accounts* of the results of applying the latest statistical 
methods to the working up of material collected by certain 
psychologists. In the case of the first of these two, viz., the 
research of Messrs Heymans and Wiersma into the inheritance 
of physical characters, the material had been obtained by the 
questionnaire method, 90 questions in all being sent out, divided 
into six classes according to the nature of the characters to 
which they referred. Examples of the questions are: 


Class I, Movements and occupations (Q. 1—8), mobility or 
restfulness ; steady or temporary industry or laziness; etc. 


Class V. Inclinations (Q. 44—81), use of alcohol; vanity ; 
ambition; love of truth; ete. 


More than 400 sets of answers were returned. 


In cases where the questions involved only two alternative 
answers, the four-fold table method was used in evaluating the 
correlation between parents and children; where four or five 
alternative answers were demanded, the contingency method 
was used, and the result tested by a four-fold division of the 
table. The results were corrected for assortative mating by 
means of the correlation coefficient between father and mother. 
(In the tables for coefficient of assortative mating, the parents 
were weighted with the number of their children.) The co- 
efficients of assortative mating were found to be more frequently 
negative than positive. The account in Biometrika concludes 
as follows: “After allowing for the effect of assortative mating, 
the mean coefficients of resemblance between fathers and sons, 
and between mothers and daughters, calculated for one set of 
characters by the contingency method and for another set by 
the four-fold correlation method, come in each case to very 


* (a) ‘*The Inheritance of Psychical Characters,” being further Statistical 
Treatment of Material Collected and Analysed by Messrs G. Heymans and 
K. Wiersma: by Edgar Schuster and EK. M. Elderton, Biometrika, Vol. v. 1906- 
1907, pp. 460-469. 

(b) “On the Association of Drawing with other Capacities in School 
Children,” by E. M. Elderton; being a biometric evaluation of the results of 
EK. Ivanhoff: ‘‘Recherches expérimentales sur le Dessin des Ecoliers de la 
Suisse Remande,” Archives de Psychologie, 111. vit. 1908. Biometrika, Vol. vu. 
pp. 222-226, 
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nearly 4, the value proposed originally for the parental inheri- 
tance coefficient by Sir F. Galton” (p. 469). 

In the second case, that of Ivanhoff’s data in his investiga- 
tions into the relation of drawing to other mental abilities in 
school children, Miss Elderton evaluated the results by the 
contingency method (nine-fold grouping, divisions being “fort,” 
“moyen, and “ faible”) and, in some cases, by the four-fold 
table method also. The conclusions she was able to draw were 
briefly as follows : 


(1) Drawing correlates with other abilities more closely in 
girls than in boys. 

(2) The correlations in all cases are extraordinarily low. 
Miss Elderton suggests that “this may arise possibly from 
capacity in drawing being a hereditary character, having small 
association with other school measures of fitness.” 

(3) General ability shows the highest of the correlations, in 
both sexes. 

(4) The so-called “ psycho-pedagogic characters ’—atten- 
tion, obedience, industry, cleanliness, and temper—correlate less 
closely than proficiency in the ordinary subjects of the school 
curriculum do. In the case of the girls, they are all the lowest 
on the list. 

Accounts of other investigations into the relationship 
existing between the various subjects of the school and college 
curriculum are to be found in papers by W. P. Burris, 
A. G. Smith, Brinckerhoff, Morris and Thorndike*. The results 
are somewhat conflicting, and in every case surprisingly low. 
Thorndike and Fox? found low correlations between part- 
capacities of the same subject (arithmetic); and Burris 
obtained a coefficient for algebra and geometry which was 
nearly as low as that between mathematics and a non-mathe- 
matical subject. JI have myself found} that, after correction for 
effects of irrelevant conditions, algebra and geometry show hardly 


? 


* Allin Vol. x1. of Columbia Contributions to Education. 


+ Op. cit. pp. 1388-143. 
+ William Brown: ‘‘An Objective Study of Mathematical Intelligence,” 


Biometrika, Vol. vi. Part 111. 1910. 
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any correlation at all—in fact, the partial coefficient for 
“arithmetical ability constant” is zero. 


An experimental investigation of the inter-relations of some 
of the higher mental processes in 50—90 normal school students 
was made by A. Peterson* in 1908. The marks used were 
good, medium and poor, and the following are some representa- 
tive coefficients selected from those obtained : 


Reasoning and generalizing ‘95, Generalizing and memory 40 
3 » memory ... 40, , » accuracy ‘28 
¥ » accuracy ... “45, Memory and accuracy ... ‘31 


As one further illustration of the use to which the theory of 
correlation has been put in Educational Psychology, mention 
may be made of an investigation by Thorndike +t into the cor- 
relation between entrance marks and later performances at 
college. For total marks the coefficients were: 

Freshman year, ‘62; Sophomore year, 50; Junior year, 47 ; 
Senior year, ‘25. 

Taking the various subjects separately, Thorndike found a 
slight superiority over the rest in the case of science, and a 
slight inferiority in the case of mathematics. The number of 
cases worked on was 56—84. 


In a recent short paper{, Thorndike brings forward correla- 
tion results which he considers to be “very strong evidence of 
the dependence of efficiency of memory upon content and of 
the specialisation of mental functions in general.” On the 
other hand, he finds that “the relation between retention of 
the effects of an experience for one or two minutes and their 
retention for one or two days seems to be one of the closest yet 
measured in human nature.” The “raw” correlation was 54 +:1, 
(n= 40), which when corrected for mixture of sexes and 
inaccuracy of measurement gave a final value of ‘8 +:1. 


* A, Peterson: ‘‘Correlation of Certain Mental Traits in Normal School 
Students,” Psychological Review, Vol. xv. 1908, pp. 323-338. 

+ E.L. Thorndike: ‘‘ An Empirical Study of College Entrance Examinations,” 
Science, N.S. Vol. xxi. June 1906, pp. 839-845. 

+ E. L. Thorndike: ‘‘The Relation between Memory for Words and Memory 
for Numbers, and the Relation between Memory over Short and Memory over 
Long Intervals,” American Journal of Psychology, Vol. xx1. 1910, pp. 487, 488, 
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CHAPTER III 
SOME EXPERIMENTAL RESULTS* 


The research to be described in this chapter was devised for the 
purpose of determining to what extent correlation exists between 
certain very simple mental abilities in cases where the individuals 
experimented upon are, as near as may be, identically situated 
with respect to previous practice, general training, and environ- 
ment; and how closely, if at all, these elementary abilities are 
related to general intellectual ability as measured by teachers’ 
judgments, school marks, ete. Every effort was made to keep 
the groups of individuals tested as homogeneous as possible ; 
and instead of measuring irrelevant factors and “correcting ” 
for them in the later stages of the research, the influence of 
such irrelevant factors was excluded right from the beginning 
by a rigorous segregation of the material, and in other ways. 

The groups of individuals to which the tests were applied, 
were as follows : 

Group I, 66 boys of a London elementary school, all between 
the ages 11 and 12. 

Group IT, 39 girls of a London elementary school, all between 
the ages 11 and 12. 

Group III, 40 boys of a London higher grade school, all 
between the ages 11 and 12. 

Group IV, 56 training college students (women), of the same 
year and of approximately the same age. 

Group Va, 35 university students (men). 

[Group Vb, 23 university students (women). | 


* Published in the British Journal of Psychology, Vol. 111. Pt. 3, Oct. 1910. 
A few unimportant alterations have been made, and a summary of the results 
has been added. 
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Little need be said as to the nature of the groups. Group 
ITI was as homogeneous as could possibly be expected or desired. 
The individuals were not only of the same age, but also belonged 
to the same form and had all worked for months past under 
exactly the same environment (same teacher, etc.). They were, 
however, a rigorously selected class, as might be expected from 
the character of the school. 


Group IV was also thoroughly homogeneous. During an 
entire year previous to the application of the tests they had 
lived under exactly the same environment. 


In Group II there was a slight mixing of “standards” which 
introduced a slight degree of heterogeneity, but the effect of this 
on the results must have been very small. 


Group I was also slightly heterogeneous owing to mixture of 
standards, and the results show that the effect of this was some- 
what greater than in the preceding case. 


Group Va was fairly homogeneous, but was of course a 
“selected” group. The same remarks apply to Group Vb, but 
owing to the smallness of the numbers (28) tested, the results 
were worked out by the method of ranks (p), which was con- 
sidered good enough under such circumstances, and they are 
recorded avowedly as mere approximations. 

Other groups of school children were also tested, but as the 
marking of the results is not yet complete, no further reference 
will be made to them here, except in the case of the “addition 
tests,” for which see Appendix II, note at top of first page. 


As regards the tests employed, they were chosen not so much 
for their novelty (though a few of them are new and the 
methods of applying the tests were determined in every case 
entirely by the requirements of the circumstances), nor so much 
for their @ priort likelihood of showing inter-correlation, as for 
their convenience in admitting of application to an entire group 
of subjects simultaneously and wnobtrusively. The following is 
a list of them: 


1. Crossing through letters e and r in a page of print. 


2. Crossing through letters a, n, 0, and s in a page of 
print. 
7—2 
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3. Crossing through every letter in a page of print. 

4, Adding up single digits in groups of ten. Measurement 
of (a) speed, (6) accuracy. 

5. Bisecting ten printed lines (80 mm. long), and putting 
in one of the points. of trisection in each of ten other lines 
(90 mm. long). 

6. Miuller-Lyer Illusion. Measurement of (a) size, (b) mean 
variation. 

7. Vertical-Horizontal Illusion. Measurement of (a) size 
(6) mean variation. 

8. Mechanical Memory (permanent), tested by means of 
nonsense syllables. 


9. Memory for poetry. 

10. Combination test (Ebbinghaus). 

In the case of Groups II and III, recourse was also had to: 
11. Marks for Drawing. 

12. Total School Marks. 


13. Grading for General Intelligence (two independent 
measures). 


Finally, with Groups Va and V6, the following test was also 
employed : 

14. Association-time (uncontrolled). Measurement of rate 
of sequence of ideas called up by a stimulus-word. 

The performances of the several groups in these tests admit 
of comparison in terms of the mean, standard deviation, and 
coefficient of variation, provided that the probable errors of these 
constants are also evaluated. 

With the exception of test (9), and, in some cases, of test 
(8), every test was applied twice, the second test being given 
about a fortnight after the first, and at the same hour of the 
day. In the case of the school-children, I myself applied both 
tests in the presence of the form-master or mistress. The 
adults whose measurements are recorded and employed in the 
present research were also tested, with hardly an exception, by 
myself. It should be added that the research commenced with 
a very much larger number of adults (university students and 
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others), amounting to over 100, but the much smaller numbers 
recorded as Groups Va and Vb were alone used for the evalua- 
tion of coefficients, since they alone displayed sufficient relia- 
bility and homogeneity for the purpose. 


(1) ER Test. 


Pages of French words, arranged in irregular order so that 
they did not “ make sense,’ and so chosen that the number of 
e's and 7’s was approximately constant from line to line, were 
employed. The page was given out face downwards, and at a 
given signal the subject turned it over and proceeded to cross 
through every e and every 7 that he came to, beginning with 
the first line and moving down line by line, until he received 
the signal to stop. The time allowed for the test was 5 minutes 
in the case of the children and 3 minutes in the case of the 
adults. The subject was urged to avoid passing over any of 
the stated letters but otherwise to work as quickly as possible. 
Before the commencement of the test, a full explanation of it 
was given to the group, illustrated by examples on the black- 
board. This was done in the case of every test. A different 
set of words was employed in the second test. 


System of marking : 
1 mark for every letter crossed through correctly ; 


—1 mark for every letter passed over or crossed through 


incorrectly. 
Results of HR Test. 
| 
| .-.. | Reliability | Rel. coeff. (rz) for 
| Group Mead a Cee coefficient appease ee 
| Variation (r;) for | of tests, ry =——2 T 
each test ee eo 
pe eae 
I ati eG 68+4 Sot ‘60 75 
FE 362 +8 LES ES 5s esl BS) 65 ao 
Til 417 +6 57+4 1lo°L 2h TD 86 
*V (a) | 204445 | 414+3°3 2Q0+1°7 97 — 


* One test only. 
+ 72 measures the extent to which the amalgamated results of the two tests 
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(2) ANOS Test. 


The method of procedure was identical with that described 
above for test (1), except that the letters to be crossed through 
were of four kinds instead of two, and that the time allowed 
was 5 minutes for children and also for adults. 


Results of ANOS Test. 


Coefficient | Reliability «pede 
1 . Reliability 
Group | Mean | o a ve mee eee coefficient 7°, 
1 
I | 16145 | 58+35| 36+2°4 il ‘87 
. DL. «| 19945:6 | 52433 | 2652-272 84 9] 
| An |-29826 | 566490) 244420) 81 “89 


(3) Motor Test. 


In this test the subjects were asked to cross through every 
letter in a page of printed French words. Time allowed in all 
cases—three minutes. Method of procedure otherwise identical 
with that for (1) and (2). 


would correlate with a similar amalgamated series of two other applications of 
the same test. If «,, xg, x1’, 79’ be two pairs of results (« denoting, as usual, 
deviation from the mean value), we may assume that 
Cx, = Cx, = Cx, =Cx/ =Cx (say), 
and that Ses (es, HS, 2) = S (45 ee JS, 7;- 
S (@ + £9) (41' + 2 
Hence we get ro= (a a) ey hp) 
NT x, +H. Fx,'+ 0! 
2y 
i; 4n Oy V1 
Nn (20,2 + 27,0,,2) 


= hy 
ae er a 
It is easily seen that the amalgamation of 4 tests gives a reliability 


Q. E. D. 


: Ar : 
coefficient= ; and, in general, for n tests we have 
1 


ak nr, 
™™14+(n-1)r," 
This last formula furnishes a ready means of determining, from the reliability 
coefficient of a single test, the number of applications of the test which would be 
necessary to give an amalgamated result of any desired degree of reliability. 


SOME EXPERIMENTAL RESULTS 


Results of Motor Test. 
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2 Ser ms ; 
Coefficient | Reliability eee 
Group Mean o of coefficient i et 2 ; 
Variation t; Z 
E W18+13 | 148+9 20°6+1°3 9] 95 
i} | 20416 | 145412 | 205217 "85 92 
III SISt Tl | 10S 77 | 127i “16 86 


(4) Addition Test. 


Duplicates of pages from one of Kraepelin’s Rechenhefte 
were used, adapted to the purpose by the printing of short 
horizontal lines below each tenth figure, and by the omission of 


all figures below the thirtieth in each column. 
for each test—five minutes. 


Time allowed 


The speed of addition [4 (a)] was 


measured by the number of sums (groups of ten digits) worked 
in the given time, the accuracy of addition [4(b)] by the per- 
centage of correct answers. In the case of the children, 4 (a) was 
measured by the number of digits added, marks being allowed 
for the part of a sum with which they usually ended. 


Results of Addition Test. 


Mean o 
Group E 
Speed Accuracy Speed Accuracy 
I 9a0 6 1638 +3 67 +4 ol oe 1S 
ial F106 149 +4 52+4 ao 44 25 
EET 287 +9 171+1°4 79+6 13°31 
*IV 55+1°5 Liggett | 16+ 1°0 16°8+1:1 
*V (a) GF 21 184+1°2 19-15 Tr3+ ‘9 
V (b) = hiss = s — 


* One sum of 


10 digits taken as unit ; in other. cases, the number of digits 
added was taken as the measure. 
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Results of Addition Test (cont.). 


: cane Reliabilit Reliability 
Coefficient of Variation obetaent odode 
Group 5 oo 
Speed Accuracy Speed | Acc. | Speed | Ace. 
I | 28+1°8 19312 "82 33 90 50 
hi | 2464+2°0 2244-13 “69 ‘46 | “82 | ‘63 
Tt 27 +2°2 8+°6 ‘68 [0] coll --- 
= | 30 +2°9 9°6 + 86 93 "29 96 "45 
*V (a) 324+2°8 Oae"Dy 7. OD. 22° | NOF "36 
od) a _- 1 08 Bo | °99 “74 


(5) Bisection and Trisection of Lines. 


Every test paper contained ten printed lines, each 8 cm. 
long, for bisection; and ten printed lines, each 9 cm. long, for 
trisection. ‘The lines were printed three in a row, and those 
situated immediately under others were shifted a little to one 
side. It was very certain, however, that the bisection or 
trisection of any one of the lines was influenced by the positions 
of the neighbouring lines and of the edges of the paper. This 
fact diminishes the value of the test, but does not of itself 
deprive the test of all use as a measure of one form of sensory 
discrimination. A more serious drawback was found to be the 
very great individual variability displayed, which made the 
reliability coefficients very low. In order to get a more 
rehable measure for Group V the results for bisection and 
trisection in both tests (i.e. of the division of 40 lines in all by 
each individual) were thrown together and the total taken as a 
measure of sensory discrimination. Only one point of trisection 
was asked for, this being put in alternately towards the left and 
the right ends of the successive lines. Trisection was done 
very unsatisfactorily by the school-children, but, in the case of 
adults, gave a higher measure of reliability than did bisection. 


* One sum of 10 digits taken as unit; in other cases, the number of digits 
added was taken as the measure. 


SOME EXPERIMENTAL RESULTS 105 


The average crude error was taken as the measure of in- 
accuracy, since it was found to give more concordant results 
(higher reliability coefficients) than the other possible way of 
measuring the inaccuracy, viz. the mean variable error. 


Results of Brsection and Trisection. 


Coeff. of Rel. Rel. 
| Variation | coeff. r, | coeff. r, 


pat in Anta |e ee eee 
| Bisection } | 


Group Mean | o 


| T= B11+11 12948 | 4142°8 35| ‘52 
only 4 | | 
a | (B,. *36 | B. -63 
yr | G zB 5 GC 
| Bisection «| ee ee VE, e5r T7528 
| Trisection V(b) (B. -44| B, 93 
| z a re (T. 87 | T. -82 


(6) Miiller-Lyer Illusion. 
The adjustable apparatus, designed by Dr W. H. R. Rivers, 


was used to measure the size of this illusion. The length of 
the standard line was 75 mm., and results recorded below are 
also in mm. Each child of Groups II and III was tested 
individually by myself, being asked to make ten adjustments of 
the apparatus, alternately lengthening and shortening the 
variable line, and having the standard line alternately to the 
right and to the left. To obtain the reliability coefficient, the 
results were divided into two halves and correlated. The 
average deviation was taken as a size of the illusion, and the 
mean variation (M.V.) was also determined. ‘Ten subjects of 
Group IV were tested, 10 times each, by one of the mistresses 
of the college. Groups V (a) and V (6) were tested by myself, 
but four times only. 

The present test was the only one employed which involved 
the use of apparatus or the testing of the subjects separately. 
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Results of the Miiller-Lyer Illusion Test. 


Rel. coeff. for 
Mean C Coeff. of Variation | size of illusion 


onl 
Group ? 


Size M. V. Size M.V. Size M..Y. r; ie. 


Il |17-3+°31 | 3-34-14] 2:8+-22|1:3+4-10|162+1:3/39+3-4| -65 | -79 
II] [16844 |3:14£:12/3-8+°3 | 114-08) 22:541°8/36431| 86 | -92 
IV |13°741:11/23+°18/53+°8 | 84:13) 38465|/36461| -76 | -86 
*V(a)| 16+°67 Se 93431) — 2 eae ee 
*V (b) a = ee fom = 2 66. fis 


(7) Vertical-Horizontal Illusion. 


The material for this test consisted of a set of 10 large 
L-shaped figures clearly printed on a very large sheet of paper, 
which could be folded in two. Each L had unequal arms, the 
shorter being 10 cms. in length, the longer 14 cms., and the 
vertical was alternately the shorter and the longer of the two. 
These papers having been distributed, it was explained by 
means of the black-board that the task to be performed was to 
mark off a part along the longer arm (estimated from the 
angle) such that it seemed equal to the shorter arm, the 
subject limiting his attention strictly to each figure in turn, 
and estimating by eye only. When all the ten figures had been 
marked in this way the subjects were asked to go over them 
once more, altering those which seemed too long or too short, 
the object of this being to make sure of the full effect of the 
illusion (of the existence of which, by the way, not one of the 
subjects tested—Groups I, II, HI and [V—was aware). A 
fortnight later the test was repeated with other papers. The 
average size and the M.v. of the illusion were evaluated as in 
(6) above. ‘Two objections may be made to this method of 
applying the test: (1) the presence of -surrounding L’s in- 
fluenced the judgment; this was partly obviated by the way 
the figures were arranged on the page, and I believe the 


* One test only. 
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influence was actually very small, each L being large enough to 
exclusively rivet the attention of the subject upon itself in its 
turn; (2) the eyesight of the subjects of the experiment was 
not previously tested. This objection is much more serious. 
Even if the illusion is not to be entirely explained as the effect 
of astigmatism, the latter must play an important part in 
determining the result. All we can say, then, is that the test 
measures the balance of effect of the various factors contributing 
towards the falsifying of judgments comparing horizontal and 
vertical distances. A somewhat remarkable result, which I do 
not remember to have heard or seen reported before, is that 
with as many as 20 measurements of each subject, quite a 
large proportion of the subjects show a negative illusion, 1.e. they 
underestimate the vertical instead of overestimating it. One 
might retort that this is simply a case of over-correction, were it 
not for the still more remarkable fact that in the case of all the 
children measured, the proportion is exactly 4, in Group I, 22 
out of 66, in Group II, 13 out of 39, in Group III, 13 out of 40. 
In Group IV the second test has unfortunately not yet been 
marked ; for the first test alone the proportion 1s 12. 


Results of Vertical-Horizontal Illusion Test. 


Rel. Rel. 


Mean oi coeff. r, | coeff. 7, 


Size M. V. Size M.V. |Size|M.V./|Size|M.V. 


i 95+4°6 | 3°2+°11 | 534+3| 13+-08 | 69 | -43 | 82) 60 
IL) 2871 = 65+5 
TH [Slt62 |} sae ll | (6%: 
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(8) Mechanical Memory Test. 


In this test a printed list of ten nonsense syllables was 
placed face downwards before each of the subjects, and at a 
given signal the subjects turned the papers over and applied 
themselves to the learning of the syllables as intently as 
possible. On a second signal, 2—3 minutes later, the papers 
were once more turned face downwards, and collected by the 
experimenter. The subjects were then asked to think no more 
about the syllables for the present. On the following day, at 
the same hour, blank slips of paper were distributed and the 
subjects were asked to write down the syllables they had learnt 
the previous day, so far as possible in the right order. As a 
system of marking which was found to be sufficiently satisfactory 
for the purpose, 2 marks were given for each syllable right and 
in the right order, and 1 mark for each right but in the wrong 
order. The time allowed for learning was 3 minutes in the 
case of Groups I and II, but this was found to be too long in 
the case of Groups III and IV, who were eventually given 
2 minutes and 24 minutes respectively. On account of the 
difficulty thus raised (and, in the case of Group II, for another 
reason) two series of results could unfortunately only be 
obtained from Groups I and IY. 


Results of Mechancal Memory Test. 


| Coefficient | Reliability | Reliability 
Group Mean o of coefficient coefficient 
Variation ry Oe 
I 145+°8 O°7 OF 6/45 ‘51 68 
= 96 E°6 5°7 £43 59 +6 — -— 
aie Liss 6°8 +°53 5ST EG i — 
IV 9 ats es a | Yi is ke) 24+2 50 67 


* Results of one test only. 
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(9) Memory for Poetry. 


This test was applied but once, and to Groups I and II 
only. Three verses of Hood’s “ Queen Mab,” which apparently 
neither of the groups had seen or heard of before, were set to 
be learnt for 5 minutes, and the subjects were asked to attempt 
to reproduce them 24 hours later. The frequency constants 
were found to be as follows: 


— Pkoret! 

Group Mean o Coefficient of Variation 
I Q+85 | 9746 4943-7 
Et 28°6 + 100 DOs 75 ol6+2°9 


(10) Combination Test. 


This was the well-known Combinations-Methode of Ebbing- 
haus, in which the subject 1s shown a passage of continuous 
prose with from one-third to one-quarter of the words replaced 
by blanks, and is asked to supply the missing words, or words 
of similar significance. 

In applyimg this test, a thorough explanation, including 
black-board demonstrations and examples, was first given to 
the class and the papers were then distributed face downwards. 
At a given signal the class turned the papers over and pro- 
ceeded to read the passage through carefully (writing nothing) 
with a view to grasping the general sense of the entire passage. 
On a second signal, three minutes later, they proceeded to fill 
in the blanks an order from the beginning, endeavouring to 
find in each case a word which would suit the sense both of the 
particular sentence in which it occurred and also of the entire 
passage. This second period lasted five minutes, at the end of 
which time the signal was given to stop. Such was the 
method of procedure in both applications of the test in the 
case of the school children, and both results were found to be 
quite satisfactory. 

In the case of the adults the times allowed were different, 
being 1’+ 10’ for the first passage, and 1’+ 3’ for the second, 
and the reliability coefficient for Group V was found to be 
abnormally low. As this seemed to be due mainly to the 
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unsatisfactory way in which the first test was performed (I had 
chosen the passage badly), the results of the second test were 
alone used for the purposes of correlation. In Group IV the 
reliability coefficient was higher, though still not very high, 
and the two series of results were therefore amalgamated in the 
usual way. 

In marking the papers, words supplied by the subject were 
counted right if they made sense in their sentence and tolerable 
sense in the entire passage, and Ebbinghaus’ system of values 
was adopted ; viz., 


Each blank filled in correctly = 1 mark, 
- - » 5 Incorrectly = —1 mark, 
. , passed over = —4 mark. 


Results of Combination Test. 


| | | Coefficient | Reliability | Reliability 
Group | Mean | og of | coefficient | coefficient 
| | Variation | ry | ry 
| | 
| | * | 
i | 22413 | 163+ 95 | 74463 | 74 | "85 
PL 1O7V412. | 11-44 87 |) SHG | Ee 9" ve 
III bai 8. VE IS Cie perez 84 
IV | 713421 | 24 +15 | 38433 | -46 63 
*Vi(a) | 1854 -78 | 7144. 56 | 38434 1-22 = 
Voy | i | nat an 3: Gane ee 5 


| 


Measurements 11, 12, and 18. 


Measurements 11 and 12 (marks for Drawing and Total 
School Marks) need no further explanation. The grading for 
General Intelligence was obtained from two of the schools— 
Groups II and [JJ—from the former of which two separate and 
independent gradings, by different teachers, were provided. 
These independent gradings correlated with one another to the 
extent of ‘90, which gave a reliability coefficient 7, for the 
amalgamated grading = ‘95 +. 


* One test only (the 2nd). 

+ The high value of this coefficient contradicts the views of certain writers, 
but is borne out by a later research by Miss H. Gertrude Jones, whose results 
have been statistically evaluated by Karl Pearson, Biometrika, Vol. vir. 1910, 
pp. 542-548. 
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(14) <Assocration-time. 


This test was applied to certain individuals of Group V 
(a and b) only, and was of the following nature. A word of 
ordinary significance (a noun) was read out to the subjects and 
they were expected to write down as rapidly as possible, during 
the two minutes which followed, words representing the various 
“ideas” which passed through their mind in the time. After 
a short pause, another quite different word was called out and 
the writing repeated. Finally a third word was called out. 
The total number of words or phrases written down was taken 
as a measure of the rate of sequence of associated ideas in the 
subject’s mind. The test was repeated a fortnight later. 


This method gives fairly reliable results—for Group Va, 
r,='67 and for Group Vb, r,='87—but is vitiated by the 
mechanical process of writing. The impurity could be elimi- 
nated by applying a simple writing test (speed) also, and then 
employing the formula for the “partial” correlation of three vari- 
ables ; it was not, however, done in the present research. 


General Remarks upon the Tests. 


The results tabulated in the last few pages, when tested by 
means of the formula for the P.x. of a difference [oy = Vox? + oy 
aid 22) PE y= ‘/ P.B..° + P.E.y’], show certain differences between 
group and group in respect of average ability, variability and 
reliability for correlation, which justify our plan of working 
correlation coefficients separately for the several groups, but do 
not seem otherwise to give many positive results of general 
significance and importance, such as, e.g., evidence as to the 
relative variability of the two sexes. A more careful and 
thorough examination of the tables may give cause for some 
qualification of the preceding statement. At any rate the 
individual figures are of considerable interest. The reliability 
coefficients, even for the single tests, are in most cases 
sufficiently high,—in fact much higher than I had dared to 
expect considering the circumstances that in all but one test 
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the subjects were examined collectively. The less satisfactory 
tests, as applied in this research, seem to be those for accuracy 
of addition, bisection and trisection of lines, M.v. of the vertical- 
horizontal illusion, and, in a slighter degree, mechanical memory. 
The combination test, in the case of adults, was the most 
unsatisfactory, but in the case of the school children it gave 
fairly high results. The tests in which two applications gave 
very reliable results are the motor test, the ano s test, speed 
of addition, the combination test (with children) and the er 
test. The coefficients of variation are rather high, clustering 
about the values 20—30; in a few cases they are considerably 
higher. 

The tests were applied during the course of the summer of 
1909, and my sincere gratitude and thanks are due to the head 
masters, head mistresses and others, through whose kindness 
I was enabled to bring the research to a satisfactory conclusion. 


As a fact of considerable importance, 1t should be added 
that the tests were so applied as to disturb the ordinary routine 
of the schools as little as possible. 


Correlation Results. 


The values of the frequency constants for the various groups 
of subjects show very clearly that any plan of throwimg them 
together (as they stand) and working out coefficients from the 
combined series would produce a considerable amount of 
“spurious” correlation and make the results almost valueless. 
One exception, indeed, to this state of affairs was found in the 
case of Speed and Accuracy of Addition [4 (a) and 4 (b)]in Group I 
and in a group of 20 boys not otherwise included in the present 
research. The means and $.D.’s in these two sets of boys for 
these two characteristics were found to be the same, within the 
limits of probable error. These 86 boys were therefore taken 
together for this particular correlation and a correlation table was 
drawn up (see Appendix II and p. 59 above) whereby the value 
of n could be calculated as well as that of r, and the nature of the 
regression curve and regression line determined. To make the 
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investigation into this particular problem of speed and accuracy 
in adding complete, small correlation tables were drawn up for all 
the other groups, separately from one another, and the value of 
n was calculated in each of these cases also (Appendix IV). 
Apart from these cases the plan of grouping in correlation 
tables seemed quite unsuitable for such small numbers. The 
values were therefore taken as they stood, but the full product- 
S (wy) 
* Nevo. 
single exception of Group V(b). Here the numbers were so 
6S (1, — v2)? 
NN? = 1) 


f= 2 Sin @ p) was considered sufficiently accurate, since 


, was employed throughout, with the 


moment formula 


very small (23) that the method of ranks |p =l1- 


nothing but a general impression of the nature of the corre- 
lation could be expected. 

The following Tables give the values of the correlation 
coefficients between series formed by the amalgamation of the 
two measurements made in each test. The numbers immedi- 
ately below the coefficients are the probable errors, and those 
in thick type are the reliability coefficients (r,) for the 
amalgamated series, showing to what extent each amalgamated 
series would correlate with another quite similar series. In 
these tables, the tests are arranged according to order of 
magnitude of the average correlation of each with all the rest 
(within any particular group) and all coefficients smaller than 
their probable errors are put down as 0. Of the coefficients 
recorded, the total number of those >2 P.E. is 139, and of those 
>3 P.E. the total is 86. The first thing to be noticed in the 
groups of coefficients arranged in this way is that not one of 
them shows the “hierarchical arrangement,” and it is a very 
significant fact that the group which approaches 1t most nearly 
(Group I) is the group where “spurious correlation” due to 
heterogeneity of material was to be suspected (see p. 99). 
Now it will be apparent, on the slightest reflection, that any 
extraneous source of correlation (such as, e.g., difference of the 
state of discipline to which different members of the group had 
been accustomed immediately antecedent to the occasion of 
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GROUP IV. (56 women students.) [Provisional 
and wmcomplete table of coefficients. | 


Combination Addition Addition Mechanical 
(accuracy) (speed) Memory 

Combination 63 53 coy! | 

‘06 08 ‘OS 

Addition 53 “45 “43 ‘20 

(accuracy ) 06 07 ‘09 

Addition "34 43 96 18 

(speed ) ‘08 ‘OF "09 

Mechanical 31 20 18 67 
Memory ‘08 ‘09 ‘O09 


Variability coefficient for speed of addition : 
Vspa+sp.. > 0°33 +08. 


Sp.1~S)p.o 
Variability coefficient for accuracy of addition : 
Vace., + ace., — — 0:66 + °05. 


ACC.1 ~ ACC.» 


[Correlations of M. L. Ill. (10 cases only*.) 


Tuy. mn. = 9, ’'M.V. of MLL. mn. = 28 + ‘19, 
Comb. Comb. 

Te. w= oo + 18, MV of Mi. In, =— "21 £°20, 
Mech. Mem. Mech. Memory 

MyM. mn. = — ‘d7 +14] 


M.V. of M.L. Il. 


* Owing to the smallness of the sample, the p.n.’s (calculated by the 
ordinary formula) are too low, and the results cannot be regarded as anything 
more than very rough approximations. 
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GROUP V (a). (85 men students.) 


Agssocia- Bisection 
tion Addition Combi- Addition M.L. 
er time (acc.) nation (speed) Tll. Trisection 
er ‘97 — 18 — 96 19 O "42 — "94 
= ts 10 “tt “09 0) 
Association —‘18 87 39 33 Oy. O 0 
time lh O09 40) “09 
Addition — ‘26 39 36 — 16 38 O 0 
(accuracy ) “10 09 et “09 
Combina- 19 33 = "16 22* 19 — 24 0 
tion “td S10) “tt “td. ‘10 
Addition 0) OT 38 ‘19 ‘97 0) “13 
(speed ) "09 ‘09 aid “dd 
M.. b. Tl. 492 O O — 94 O 5 —°99 
09 10 ‘10 
Bisection+ — ‘24 0) 0 0) “1 ‘29 B=°'53 
Trisection 0 Bi! 10 T=°52 
9 coefficients > 2 X P. BE. 5 coefficients > 3 x P. BE. 


* The second test only was used in this case for correlation with other tests 
since the low correlation between the two was almost certainly due to the 
unsatisfactory nature of the first. The value ‘22 is, then, 7). 
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[GROUP V(b). (23 women students*.) 


6S (d’) 
Method of ranks used =l]- ; 
: N(W—1) 
s 1Ie 
r= 2 sin | — ) 
(F p}-5 
Associa- Bisection 
M.L. tion Combi- as Addition Addition 
Til. time nation Trisection (acc.) (speed) er 
RL. TL 68 —'78 — ‘90 — °84 ="53 ="66) | —<42 
05 "03 ‘O4 10 ‘08 | 
Association — ‘78 67 58 “53 58 35 A3 
time O05 ‘O09 “10 ‘O09 yi 2 oy | 
Combina- — ‘90 "58 82 O 28 ‘3 "52 
tion "03 09 tS “14 10 
Bisection + — °84 53 O B='6l ‘29 ‘40 0 
Trisection ‘Od ‘10 T= a0 ms) "12 
Addition —°53 58 28 29 ‘74 ‘35 0) 
(accuracy ) ‘10 ‘09 "18 "13 "12 
Addition — ‘66 35 1S ‘40 3D "99 O 
(speed ) OS “2 ‘1h 12 LZ 
er ~ +42 43 52 0 0 0 58 | 
i hi 10 
16 coefficients > 2 x P.E. 12 coefficients > 3 x P.E. 


* These results are recorded as avowedly rough approximations only, owing 
to the smallness of the sample. 
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applying the tests) the influence of which is in a constant 
direction but varies in amount from test to test according to 
the varying degrees to which the individual tests are susceptible 
to its influence, must tend to produce the hierarchical arrange- 
ment, and unless counteracted by other more potent tendencies, 
will do so. In fact, it will be the “central factor” supposed 
to be indicated by such a form of arrangement of coefficients. 
Spurious correlation of this nature might arise from the use of 
unfamiliar apparatus in the tests, or from the novelty of the 
tests, or in many other ways. The form of procedure adopted 
in the present research was specially devised to reduce such 
extraneous sources of correlation to a minimum, being assimi- 
lated as far as possible to the ordinary class-work of the school. 

A definite solution of the question of the existence or non- 
existence of one central mental ability is yet to be sought. It 
can only be obtained by the use of much larger random samples 
than those hitherto employed, since the probable errors must be 
small compared with the coefficients, if precise inferences are to 
be drawn from the latter, and in the case of small samples, this 
condition is satisfied only for large correlation coefticients, which 
when obtained are often merely the result of selecting tests 
which measure closely similar mental abilities. In all results 
hitherto quoted in support of ultimate identity of general 
intelligence and general sensory discrimination, the correlations 
contributed by the latter are so small compared with their P.E.’s 
that nothing definite can be inferred from them. On the other 
hand, in such cases it 1s easy to propound hypotheses since the 
bounds of possibility are nowhere limited in any unambiguous 
way. 

Certain sub-groups can be chosen from the tables so as to 
show a hierarchical arrangement, e.g. Group III, School Marks, 
General Intelligence, Mechanical Memory, and Combination. 
In fact the general law,—so far as the results allow of the con- 
fident formulation of any law at all—would seem to be that the 
tests fall into a number of such sub-groups, correlating highly 
among themselves, but not at all highly with members of other 
sub-groups, though an individual member of one sub-group may, 
exceptionally, correlate highly with an individual of another 
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sub-group. In order to bring out these relations more clearly 
and also to show the relation of the main groups with one 
another, the table on pages 122 and 123 was drawn up. The 
results there to some extent explain themselves. Differences 
in correlation between the two sexes, though well marked, do 
not seem to follow any general law. On the whole, the correla- 
tions are lower* in the girls than in the boys, higher in the 
women than in the men. A striking feature is the fairly large 
number of instances of negative correlation in the girls corre- 
sponding to positive correlations in the boys. 


Comparing relative order of tests in the tables for Groups 


I, II, and III, I get: 


I Rank i Rank II Rank 
er i! Combination 3 Combination 3 
anos 2 Mech. Memory 4 er 1 
Combination 3 anos 2 Addition (sp.) 5 
Mech. Memory 4 er 1 anos 2 
Addition (sp.) 5 Motor a Mech. Memory 4 
Addition (ace.) 6 Addition (sp.) 5 V. BH. 8 
Motor 7 bee = aR a 8 Motor a 
Vets 2h, 8 Addition (acc.) 6 Addition (acc.) 6 


The rank-correlations here are 
pr, = 66, prim = 74, pr, m= "55. 
As might be expected, the two groups of boys correspond 
more closely with one another than either does with the group 
of girls. (Compare value deduced from Mr Burt’s results, 


p. 93 above.) 


The method of multiple or “ partial” correlation (see pp. 52, 
63,64 above) may be veryadvantageously employed to investigate 
the way in which the correlation coefficients are related to one 
another. Thus, taking the three variables, er, anos, and motor, 
and using the formula 
rs Tie — Vis Nos 

V1 — 7215V1 — P59 
I get the following values for the partial correlations in the first 
three groups: 


Vy9 


~2 
a0 


Group I Group II Group II 
er, anos 80°03 "82 + 04 (6+ 04 
er, motor “a0 0D 55 +:°08 1 09 
anos, motor — "38 +°07 —'35 +°09 = 284-10 


“ This may be due to greater variability. 
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Thus the original positive correlation between anos and 
motor is due entirely to the correlation of each with er. For 
“er constant,” the correlation is large but negative, in all three 
cases. 


Employing the four variables, School Marks, General In- 
telligence, Combination and Mechanical Memory (Group IT) 
and using the formula 

he (I cS 154) — Tis (123 — Tea s4) — Tra (124 — Tog 131) x 


ca a 
Vv1— 13 Pg Pog + Wig PaT 4 V1- 19g — 154 — Tog + 2103 1341 04 


(see p. 64), I find the correlation between Combination and 
General Intelligence, assuming constant ability in Mechanical 
Memory, and as shown by School Marks 

=O Ok. 


The “entire” coefficients = 0°43 +°09. It is interesting to 
note that there is still correlation, though very slight, after the 
effect of memory and school industry and ability is eliminated. 


A more thoroughgoing application of the method of partial 
correlation was made in the case of Group I, for the tests er, 
anos, Combination and Mechanical Memory. The full working, 
according to Mr G. U. Yule’s method, is given in Appendix III. 
The results were: 


Entire Partial 

Tests Correlated Coefficients Coefficients 
er, anos 0°78 +:03 0°73 +04 
er, Combination 0°45 + ‘07 0-01 +:09 
er, Mechanical Memory 0:40 + ‘07 0:26 +:08 
anos, Combination 0°48 + ‘07 0°27 + 08 
anos, Mechanical Memory 0°29 + 08 —0'15+:08 
Combination, Mechanical Memory 0°52 + 06 0:44+ ‘07 


The regression equation for the calculation of ability in 
combination from abilities in the other three tests is 


Le = ‘0027 ey Pigs O99 Xsinos oF LOBPinech, memory > 

« in each case denoting deviation from mean ability. 
For a larger number of variables than four, the arithmetic 
of partial correlation becomes extremely lengthy and rather 


fatiguing, but there can be no doubt whatever that this is the 
one sound method to adopt in investigating the relations 
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between coefficients. Working with a large number of variables 
is only satisfactory when the original coefficients are large com- 
pared with the P.£.’s, since, as a rule (not universally *) the partial 
coefficient is smaller than the entire coefficient. The formula 
for the P.E. is the same in both cases. 


It must also be remembered that the partial correlation 
formulae assume linear regression. In the case of the correla- 
tion of Addition (speed) and Addition (accuracy) throughout 
the Groups, and in the case of Combination and Mechanical 
Memory in one Group (Group I), the “correlation ratio” (7) was 
determined and compared with the corresponding value for r. 
The results are given in Appendix IV, together with the 
regression curves. It will be observed that although the criterion 


ie works out <2°5, there seems to be a small 
67449 2 

but well-defined deviation from linearity in all the cases of 
Addition (sp. and acc.), (see, too, the regression curve in 
Appendix II). 7 is in all cases fairly high, and the general 
form of curve is approximately identical in all. With increase 
of speed, the accuracy may at first decrease (Groups III and 
VI), but after that, it in all cases increases, then decreases, then 
increases once more, and in cases III and V (a) it finally 
decreases. 


Variability coefficients (Tata) were obtained for Speed of 


Addition and Accuracy of Addition in Group IV, and were 
found to be 
0:33 +°08 and — 0°66 + :05, respectively. 

A full analysis of the entire data in this and other additional. 
ways must be reserved for a later occasion. 

As mentioned above, pp. 84 and 85, tests were made of the 
applicability of Spearman’s correction formula, with results 
which precluded the use of the formula. The question of 
correlation of errors of measurement with the true values of the 
variates and with one another has been made the subject of a 
separate piece of research by the present writer. The results 
of this investigation will be published shortly. 

* See, e.g., p. 52 above. 
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In the case of the Vertical-Horizontal Illusion Test, it is 
perhaps of interest to note that if subjects showing a negative 
value of the illusion are excluded the value of the correlation 
between this Illusion Test and the Combination Test becomes 
positive and appreciable—in Group IV 


Y Combination = 0°24 at: 0:09, 
AWG Lele JU 


(one test only of V.H. Ill), in Groups I+ III, it =0-26 + 0:08 
(this latter value probably includes some “spurious” correlation: 
see tables pp. 107, 110). If negative values are included, we 
have the results: 
Combination - 
V. H. U1. 
Group I, 0; Group II, 022 +°10; Group II, —:11 4°11. 
Gen. Intelligence - 
V. H. Ul. 


Group 1,0; Group II, 0; Group ITI, —-30 + -10. 


Conclusions. 


1. The correlation between different psychical abilities is 
not very close. Few coefficients are greater than ‘6. 

2. The size of the correlation coefficient varies greatly 
from one group of subjects to another. This shows how great 
is the danger of spurious correlation, due to heterogeneity of 
material, in psychical measurements. 

3. The Combinations-Methode of Ebbinghaus is a good 
measure of intellectual ability. It correlates with “general 
intelligence” almost as closely as “scholastic intelligence ” 
(school marks) does. 

4. Mechanical memory correlates fairly closely with in- 
telligence. 

5. Drawing also correlates closely with intelligence in the 
case of higher grade school boys. On the other hand, for girls 
of the same age the correlation is nil. 

6. The susceptibility to the Miiller-Lyer Illusion is not 
at all closely correlated with that to the Vertical-Horizontal 
Illusion. This indicates that the factors involved are for the 
most part different in the two cases. 


- 
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7. The“mean variation” of the Vertical-Horizontal Ilusion 
is positively related to the size of the illusion (Group I, ‘21 + °08, 
Group IL], “33: + -10). 


8. In the case of the Miiller-Lyer Illusion, there is a zero 
or slightly negative correlation between M.V. and size. 


9. The two optical illusions correlate differently with 
drawing, the Miiller-Lyer negatively (Group II, —°44 + :09, 
Group III, —:19 +10), the Vertical-Horizontal positively or 
not at all (Group II, °27+°10, Group III, 0). 


10. The mean variations of the illusions show a zero or 
very slight negative correlation with other measures of mental 
ability. 

11. The correlation between speed and accuracy of mental 
performance is slightly non-linear. 


12. Correlations may be very low even within a set of 
mental tests which appear to measure closely related mental 
abilities, and this when the reliability coefficients are high. 
Thus, the correlation between erasing the letters a, n, 0, s and 
erasing all the letters, is less than three times the probable 
error in every group tested. For “erasing the letters e, r” 
constant, the partial correlation is negative and numerically 
greater than 3 P.E. in each group. 


13. In homogeneous groups of subjects there is no positive 
evidence of the existence of one “central factor” to which the 
correlations between the individual mental abilities may be 
regarded as due. 


14, There is in several cases a pronounced correlation 
between ability and variability. This may be either positive 
or negative. It is, e.g., positive for speed in adding figures, 
negative for accuracy. 


15. There is definite evidence that variability, as measured 
by the coefficient of variation, is greater in boys than in girls. 
Differences in correlation, though sometimes appreciable, seem 
to follow no well-defined rule. 


CHAPTER 1V 
THE SIGNIFICANCE OF CORRELATION IN PSYCHOLOGY 


The full significance of correlation in Psychology is to be 
found in the general theory of multiple correlation, of which 
the correlation of two variables, measured by the correlation 
coefficient, is only a particular case. Karl Pearson’s classic 
deduction of the product-moment formula* proceeds on these 
lines. We will therefore attempt to state the general problem, 
so that we may have something tangible on which to base our 
discussion, 

Let there be n physical (or psychical) quantities 

(61) Wiss, ys ly 
which we observe and find correlated together; and let us 
assume that there are m causes 

€1, €2, €3 ++ Em, 
which may be physical, biological, or psychological, but which 
are absolutely independent of one another. We may further 
assume that the w’s are given by functions of the e’s, thus: 


Hy = (Gr oP Gin) 
Ws =o (€1, €2, €3 +» Em); 
bn Be (€,, €2, €3 ++. na): 
All the e’s need not occur in each a-function. 
* «¢Regression, Heredity and Panmixia,” Phil. Trans. A. Vol. cxcviui. 


pp. 443-459. The following proof is based partly upon a lecture demonstration 
given by Prof. Pearson. 
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We further assume that m>n, since otherwise the m e’s 
can be eliminated from these n equations. 

Then our general problem is this: supposing the independent 
variables, e, each follow a Gaussian distribution, what will be 
the frequency surface connecting the a’s? 

Assume that the variations of ¢ are small relative to their 
mean values. In this case we may expand the functions and 
retain only the first powers, in other words we may make the 
ws linear functions of the e’s. 


Let &’s be deviations of «’s from mean values, 


and nS ? » ” es ? ? ”) 
Then &, = hy, + AyeM. + ee + ne 
Ey = Un + AeeNe + --. + Comm | (i) 


E, = An + AneN2 + .-. + or 
By hypothesis, the 7’s are independent and follow the 
Gaussian distribution. Hence 


the chance of a complex of variables lying between 
An? 
Mm, m +6m =e Xe 28’ dm, 
the chance of a complex of variables lying between 
_in? 
Nas Ne + On, = 0, Xe 2% dns, 
the chance of a complex of variables lying between 
_ 1%’ 
Nmns Mn + Om = Cm X @ 2m dnm. 
Whole chance = product of these. 
We can now eliminate one of the 7,’s in the following 
manner. 


Let us solve the n equations (i), finding the value of 


1» UP cH © Nn 
in terms of 
E; — AinstiNns — AinteNn+2 — +++ — Am Nm, 
E, — Aenti tnt — An+2Mn42 — +++ — Gem Nm) 
ae — Anna Nnq — Gnn+2Nn+2 — +++ 7 AInm Nm: 
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Since these equations are linear, our chance of a complex 
of variables lying between 


&, and &,+ 6&, & and &4+6&,... & and &, 4+ 6&,, 
Ney BDO Hyg + Onnagh ccs Ny BOA Hy Ong 
will be of the form 
Const. x @* Huda" tex O97, 00) 5.) Oly K ONnay a> Oyo (Md: 

In this expression q, is a function of the &s only, q is a 
function of 7, to m7» only, and q,; 1s a function of the products 
of &’s into 7’s. 

In order to find the chance that our complex of variables 
lies between &, & + 6& up to &, &,+6&,, we must integrate 
expression (ii) for all values of m4; up to 7m between the 
limits + 00 and —o. 


Integrating with regard to 7,,, we find that the expression 
keeps its general form, only 7,4; has disappeared. If now the 
process be repeated with »,,., the mtegration will again leave 
the expression of the same form except that 7,1. will have 
disappeared. Continuing this for an indefinite number of 
n integrations, all the 7's will at length disappear, and we 
shall be left with a quadratic function of &s only. In other 
words, the chance of a complex of variables lying between 


& and +68, & and £468, ... &, and &,4+ 6&, 
= Const. x EF InP? FG a0&o2 + ve TYnnSn* + 29198189 + 2G 1383+ dé, dé, My Ey am (om): 


This expression represents the general form of the frequency 
surface connecting the w’s. 


It may be noted that the frequency will be constant when 
Qube + Joke + cee + Gnn€n? + 2 2&1 €o+ 2Oisties + = CONS. (iv). 
This represents an ellipsoid in n-dimensional space. | 


Taking the simplest case of two variables, expression (iii) 
becomes 
Ce aaiki) gals Betis) QE aE, soi s.s4 simone (v). 


Integrating (v) for all values of & from — 0 to +00, we 
get the Gaussian curve of & variation, so that 


1 91° 
ee ; 
Ze" J ( 


7 
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Similarly, integrating for all values of &,, 
pee ie 
ioe t ¢ ren 
If now we integrate for all values of &, and &, we get the 
total frequency, V: 
y Cn 
N92 — J? 


Writing « for &, y for £,, and r for ae , we have as the 
1y2 


equation to the correlation surface 

ia 8 x _ 2ray ts ) 
—— N e eee Tio, oy 
Qara oN 1-7 
r In expression (vi) is the coefficient of correlation. That it is 


a measure of correlation is apparent from the fact that with 
no correlation the compound probability for two variables, 


— 


ie Fs ly 
whose separate variations are represented by e 2c7 and € 2¢,°, 


; BL aie os Pray . 
would be proportional to e Ace) The term ies, in the 


Le 2 


index marks the difference between the two expressions. 


+0 foo 
It can be shown that S (wy), or i i zay dx dy, 1s equal 


to Nro,o.. 


The correlation surface represented by equation (vi) 1s in 
three dimensions of space*, and is such that vertical sections 
parallel to the two axes of coordinates are all normal curves 
and horizontal sections (1.e. the “ contour lines”) are similar and 
similarly situated ellipses. With respect to these ellipses the 
two regression lines are conjugates to the axes of # and y 
respectively. 

Other properties of the surface are: 


(1) The means of parallel arrays all lie on a straight line, 
1.e. the regression is linear. 


(2) The standard deviations of all parallel arrays are equal. 


* See Fig. 29 in G. Udny Yule’s Theory of Statistics, p. 166. 
9—2 


182 THE SIGNIFICANCE OF CORRELATION IN PSYCHOLOGY 


It should be noted that although normal distribution of 
the two variates involves linear regression, the converse 1s not 
always true. For this reason the proof of the product-moment 
formula in Chap. I is more general than that given in the present 
chapter; it assumes linear regression, but is quite independent 
of the form of distribution. Similarly, the formulae for partial 
regression follow from the proof of the present chapter, but 
they can also be obtained more generally on the assumption 
merely of linear regression, independent of form of distribution 
(see Chap. I, pp. 59, 63). 

By means of the formulae for multiple or partial correlation 
we are able to deal with more than two variables, and to 
determine the relation between any two in independence* of 
the influence of all the rest. Hence in psychology, where the 
problem of analysis is one of the most important, the theory 
of partial correlation is of the utmost value. By its means 
the mental abilities of a homogeneous group of individuals can 
be analysed out and arranged in order of closeness of inter- 
relation to one another; in other words the abilities of the 
group may be exhibited as a system, quantitatively determined 
in its various parts. We must remember, however, that the 
formulae are only applicable in cases of linear regression. 
Evidence for this should therefore be forthcoming in all future 
researches aiming at completeness; r should be tested by 7. 

Logicians remind us that Mill’s “Method of Concomitant 
Variations ” is to be regarded not as a method of scientific proof 
but as a fruitful source of suggestions for scientific hypothesis. 
So it is with correlation. A correlation coefficient is a state- 
ment of probabilities. It does not prove anything. It merely 
suggests a hypothesis as regards causation within a particular 
sphere. The investigator must go beyond the coefficient to 
the facts themselves for his scientific hypothesis. In the light 
of this the coefficient gains new significance and, if carefully 
and accurately calculated, is now of absolute worth as a 

* It is important to understand exactly in what sense this statement is 
correct. We do not eliminate the other variables when we employ the formula 


for partial correlation ; we only make them constant, i.e. we eliminate the effect 
of their variation. 
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quantitative measure of a concomitance of variation eaplained 
by a reference to more ultimate factors. For reasons already 
set out in earlier chapters, a true correlation coefficient is not 
easy to obtain in psychology. Hence it is probable that much 
of the statistical work done by psychologists during the next 
few years will be limited to an effort to obtain more reliable 
values for their correlation coefficients. At the present time 
coefficients obtained by different people and even by the same 
people on different occasions and when working with different 
groups of subjects show a great lack of agreement with one 
another. So far as this is due to differences in the “irrelevant” 
conditions of the investigations a careful statement and analysis 
of these conditions and a comparison of those in one investiga- 
tion with those in another are likely to explain the discrepancies 
and so turn them to good account. From the nature of the 
case, absolute agreement is not to be expected. If the dis- 
crepancies stimulate the investigator to go behind the coefficients 
to the facts themselves, the cause of science will be better 
served than if a closer agreement had been originally attained. 


It is well to bear constantly in mind that a correlation 
between two mental capacities does not always and necessarily 
indicate identity or partial identity of certain part-capacities ; 
it may be due to identity or partial identity of environment 
(taken in its widest sense). Special teaching may produce a 
correlation between two school-subjects which would not be 
found under ordinary conditions. Again, response to a common 
environment may have produced, in the course of evolution, a 
correlation between two mental functions introspectively quite 
dissimilar to one another. 


Partial correlation will carry us far in the unravelling of 
the tangled complex of mental ability. It will also serve to 
give quantitative precision to results when classified in the 
light of an explanatory -hypothesis. For the production of the 
hypothesis itself we must look elsewhere, viz. to psychological 
analysis and psychological insight. 


Fechner’s Fundamental Table. 


APPENDIX I* 


VN 


(=o 


0-0000 
0-0177 
0°0355 
0°0532 
0:0710 
0:0888* 
0:1067* 
0°1247 
0°1427* 
0°1609 
0°1792* 


0:1975 


0°2160 
0°2346* 
0°2535 
02725 
0:2916* 


n t=0 n t=70 
ae 

0°67 O:3111 0°84 0°7031* 
0°68 0:3307 0°85 0°7329 
0°69 0°3506 0°86 0:7639 
0:70 0°3708 0°87 0°7965 
Ot 0°3913 0°88 0°8308 
O72 O-4121 0°89 0°8673 
Os 0°4333 0°90 09062 | 
0-74 0°4549 0-91 0-9480* 
0°75 0°4769 0°92 0:9935* 
0°76 0°4994. 0°93 1:0435* 
OFF 05224 0:94 1:0993* 
0°78 0°5460 0°95 1:1630* 
0:79 0°5702 0-96 1°2380* 
0°80 O°595 1 0°97 1:3300* 
0-81 0°6208 0:98 1°4520% | 
0°82 0'°6743 0:99 1°6450 
0°83 0°6747 1:00 oo) 


If n is < 05, look in the Table not for » but for 1 —n, and 


take ¢ negative. Thus the ¢ for n = 0:25 1s — 0°4769. 


* The first two tables in this appendix are quoted from E. B. Titchener, 


Experimental Psychology, Vol. 11. Student’s Manual, pp. 99 and 101. 


‘‘ starred” values in the first table are the corrected values given by H. Bruns 
(Wahrscheinlichkeitsrechnung und Kollektivmasslehre, 1906), and quoted by 
F. M. Urban (Psychological Review, Vol. xvi1. 1910, p. 251). Sheppard’s Tables 
of the Probability Integral (Biometrika, Vol. 11.) are more generally useful than 


this one of Fechner’s. 


a 
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Miller's Table of Coefficients of Weights. 


n w" n w’’ | n w'’ 


0°50 1-000 0°67 0°824 0°84 0°373 
O51 0-999 0°68 0°803 0°85 0°342 
0°52 0-997 0°69 0-782 0°86 0311 
0°53 0°994 0°70 0°760 0°87 0°281 
0°54 0-990 Or 0-737 0°88 0251 
0°55 0984 0°72 O72 0°89 0°222 
0°56 0°977 0°73 0°687 0°90 0°193 
0-57 0969 | O74 0-661 0-91 0°166 
0°58 0-960 O75 0°634 0:92 0°139 
0°59 0°950 0°76 0°606 0°93 0114 
0°60 0°938 OUT 0578 0°94 0-089 
0-61 0°925 0-78 0°550 0°95 0:067 
0-62 O-911 0°79 O0°521 0°96 0-047 
0°63 0-896 0°80 0-492 O27 0°029 
0°64 0°880 0°81 0°463 0-98 0-014 
0°65 0°862 0°82 0°433 0°99 0-004 
0°66 0°843 0°83 0°403 


The weight of an n which is < 0°5 is the same as the weight 
of an n which exceeds 0°5 by the same amount. Thus the 
weights of n = 0°25 and of n = 0°75 are both alike = 0°634. 


Improved Table of Weights*. 


VW WwW N W 17 W 


0°50 1:000 0°67 0°932 0°84 0°694 
0-51 1:000 0°68 0°923 0°85 0°670 
0-52 0-999 0°69 0-914 0°86 0°646 
0°53 0:998 0°70 0-904 0°87 0621 
0:54 0°996 0-71 0°894 0°38 0'595 
0°55 0°995 0°72 0°883 0°89 0°567 
0°56 0:992 0°73 Ost 0°90 0°538 
0:57 0-989 0-74 0°859 0-91 0°506 
0°58 0-985 0°75 0°846 0:92 0-472 
0°59 0-981 0-76 0°832 0°93 0°435 
0°60 0-977 O77 0°818 O94 0°396 
0°61 0-972 0°78 0°803 0°95 0°352 
0°62 0:967 0°79 0-787 0-96 0°304 
0°63 0-960 0°80 0-770 0°97 0°249 
0°64 0-954 0°81 0°752 0:98 0°187 
0°65 0-947 0°82 0°733 0-99 0-112 
0-66 0°940 0°83 0-713 1:00 0-000 


* The values in this table are given by the formula 
PRE ted 
~ Oe" n(L—n)’ 
where N=no. of experiments made. 
The table is quoted from F, M. Urban: ‘‘ The Method of Constant Stimuli 
and its Generalizations,” Psychological Review, Vol. xvir. 1910, p. 253. See 
also ‘‘Die psychophysischen Massmethoden als Grundlagen empirischer 
Messungen,” by the same author, Archiv f. d. ges. Psychologie, Vols. xv. 
and XVI. 
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i eae 
ee | fe Fad ean ssn ae Lrkesuenar: a aoe ae et 
35 |-3] 105 | CSS) A Be 181°5 
iss |-2| 37 ae | ee 12 36 
145 |-1| 145 gow hae lee | ¢ 12 
18 0} —62 m3) det i 9 9 | 
155 1/155 15°5 18 oe ee 
105 | 2] ° 21 42 a 25 
ei ah 878 22-5 a | (8 42 84 
3 4} 12 18 «|| 86 = 467 | R475 
s6-NV| | +56 248 aan 
~6 
| | L | = 1 
d,=— 2. =~ 006977 = 0 08r 
86 ; 7 3G ; 
ee te yee eh - yoy 
= 21900, = 40341, 
ea od. oo 2015. 


* Sheppard’s correction. Remember that o*?=y,., andd=v’,; see above p. 39. 

+ Sheppard’s correction cannot be used here, since the units of the sub- 
groups are not equal and there is not bigh contact at the ends of the 
frequencies, 
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ARO 


Total of fre- 


Frequencies xy quencies (f) 
1+5°5—45—1°5 i O°5 
2°54054254+3-45-5-2 2 —3 
14+05+4+0°5-1 3 | 
94+2°540°5 —2°5 4 2°5 
05-1 5°5 —0°5 
142-1 6 2 
0°5 8 05 
1 9 I 
3 11 3 
—1 22 —1 


22 


71:5 — 30°75 


|__________— 
S (a’y')=40°75 | 


S (ay) =8 (a’y’) — Nd,d,T 


PoE. 


ace. of addition 


= 40°75 + 86 x ‘0057 


41°24, 
S (#y) 


No, G2 


4124 
~ 86 x 67 x 2018 


0143, 


‘67449 . 


= 00/1. 


.'» Tspeed of addition = 0°14 + 0°07. 


1-7 


VN 


* The figures in italics in the correlation table. 
+ S(a’'y’)=S (e+ dy) (y +42) 
= S (ay) +d, 8 (y) + d2S (x) + Ndjde 

= S(ay)+Nd,d2, since S (x) =S (y) =9. 

S (xy) =S (a'y’) - Ndjdo. 
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a= 
Yn-Y¥ (9x, — ¥)? X Ny, 
— "224 | O50176x 35S -175616 
— 1-081 1168561 x 18°5 = 21°618379 
+ °488 | ‘238144 x 14°5= 3°453088 
+ °322 | 10368418 = 1°866312 
+ 145 | 021025x155= 325888 
+ -490 | 2401  xil0G= 252105 
+1°719 | 2°954961 x 2°5= 7°387403 
— 1-414 | 1:999396 x 3 = 5:998188 
| S {Ne (Yu — Y)* = 43°345924 
| 
| 


Pie S ce (Yx a5 yy} 
No, 


_ 43345924 

~ 86 x 4034 

= 1249, 
pS 0358: 


payee 
pp, = 67449, 5—” 
VN 
= (1064, 
-'+ speed of addition = 0°35 + 0:06. 


acc. of addition 
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237°7 (mean speed) 


Accuracy (°/, of right answers) 
164:2 (mean accuracy) 


ae hee of ee ee in ig nie 
Fig. 16. 
Calculating 7 from the means of the y-arrays, we have 
S [ny (@y — £)} 
Noe 
_ 1968101 
86 x 2°796 
= 0819. 
",7 =0'29 + 0:07. 
To test the value of » obtained from the means of the 
w-arrays, for linear regression : 


2— 


Le ley One i ae 
67449°2°7 —~" 9407973 
= 222 ie. < 25. 


Hence regression may be considered to be linear. 


Regression coefficients : 
aes po= = "hie, 
oO 


2 
boar aye: 
oj 


* See p. 59. 
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Equations to regression lines are 
&—® =O» (y — ¥), 
and y—Y = by (a —-Z). 
.". equation to regression line AB is 
y — 164°2 ="172 (@ — 237°7), 
L.€. E728 F129 NG is cedew ensue (a): 
Similarly equation to line CD is 
x —237°'7 = 118 (y — 164-2), 
1.€. eV Cy A QVS 824. xc uaawsuieteoes (11). 


Equation (1) gives the most probable value of y associated 
with a given value of «, with a standard error 


o,V1—r?, ie. 1:991. 


Similarly, mutatis mutandis, with equation (11). 
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Example of Multiple Correlation. 
GROUP IL. (Boys, ages 11—12; n= 66.) 


Crossing through two letters (e and r). 
Crossing through four letters (a, n, 0, s). 


Combination test. 


Po eo 


Mechanical memory test. 


Formula for multiple correlation : 


Tie.34...n—-1 — Vin.34...n—1 Von. 34...n—-1 
1 1° 
— 72 2) — v2 we 
(1 ? rata ms) (1 in ee 


For four variables this becomes: 


V0. 88% 


T12.3 7 14.3 V4.3 


(La ..s)8 CL rs) 
14.3 24.3 


Yi2.34 = 


eames ie 
ae 


2.3 = : ae ‘ Bu . 
(1 ea 715)" (1 ro ge 


TABLE I. 
Correlation coefficient | log (i =7*) 
12 | 0:78 159284 
13 0°45 190173 
14 | 0°40 1°92428 
a3 0°48 1°88627 
24 0°29 | 1'96185 
34 | 0°52 1°86308 
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TABLE II. 
Correlation Product Correlation 
coefficient term of Numerator coefficient log (1 ~ 7?) 
(zero order) numerator (first order) 
12 0-78 0-2160 0°5640 12°3 ae 0°7199 168281 
13 0°45 0°3744 0:0756 13°2 OI1377 1:99169 
23 0°48 0°3510 0°1290 23°1 0°2308 197623 
12 0°78 0°1160 0°6640 12°4 0°7570 163038 
14 0°40 0:2262 01738 14:2 0°2902 1:96179 
24 0°29 0°3120 — 0'0220 24°] |—0-0386 1°99348 
13 0°45 0°2080 0°2420 13:4 0°3091 1:95639 
14 0°40 0°2340 0°1660 14°3 0°2176 1:97893 
34 0°52 0°1800 0°3400 34°1 0°4154 191774 
23 0°48 0°1508 0°3292 23-4 0°4027 1:92316 
24 0°29 0°2496 0:0404 24°3 0:0539 1:99873 
3B4 0°52 0°1392 0°3808 3d4°2 0°4536 1°89996 
TABLE III. 
Correlation Product Correlation | 
coefficient term of | Numerator coetticient log (1 - 7) 
(first order) numerator (second order) 
12°4 0°7570 0°1245 0°6325 12°34 0-727 | 167345 
13°4 0°3091 0°3048 0:0043 13°24 0-007 199998 
23°4 0°4027 0°2340 0°1687 23°14 0:272 196662 
4 
12°3 0°7199 OrOLl7 0°7082 12°34 0°727 _=_ 
14°3 0°2176 0:0388 0°1788 14°23 0°258 | 1:97009 
24°3 0°0539 0°1567 |—0°1028 24:13 | —0°152 1°98985 
13°2 071377 0°1316 0-0061 13°24 0-007 —- 
| 14:2 0°2902 0:0625 0°2277 14°23 0258 | — 
34°2 0°4536 0°0400 0°4136 34°12 0486 | 1:90843 
23°1 | C2308 |-0:0160 | 0°2468 23°14 0°272 - 
24°1. |—0°0386 0:0959 | —0°1345 24:13 |—O0°152 —- 
34°1 0°4154 |—0°0089 0°4243 34°12 0°436 — 


The regression equation between changes in intelligence 
(as measured by the combination test) and changes in the other 
three variables is 


v3 = Dire Hy + Diy. 14%. + re V4. 
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Calculation of regression coefficients. 
G—63 ld ¢@—9043, %,— 106s (or 10) 
Oy 03 eelOy (beg = Pig) (bea) ee Cl Se 
= Gs,0= F9(1 1)? L ~ Pea)? 
ca a a 


Shuilarly Gj) 4)= 40°83, 63,013 '21, G1 — 66 21, oe 1, 
CG, 9 =O Oo. 


: 01.34...” 
Again, Degli is . 


2.34... 


“ 03.0  , 
» 3) 24 = 11,04 —— = 002. 
O71. 24 


Similarly, Des. i 099, Dei ts — ‘703. 
Hence regression equation is 
X, = "002x, + ‘099x, + °703x,. 


~ The standard error (o3,,,,) made in estimating «; from a, #6, 
and «, by this equation 


a aD All) 
= 03 (1 a 3)" (1 ea oa)” (I = 4) 


= 12°78. 


SPEENDIX IV 


Regression Ourves for Speed and Accuracy of Addition. 


GROUP II. 


— > speed 
125 160 105. 230 265 . 300 
gees 


n =0°524 + 0:079, 
r= 024 + 0101, 
YN Nga 


= 2 16. 


GT449 oul? 


GROUP II. 


— > speed 
180 230 280 330 380 430 480 530 


140 


150 


160 


170 


acc. <— 


180 


190 


200 fee ha he 
7 = 0°38 + 0:09, 
r= 0:33 + 0°09. 


10 
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GROUP TY. 
— > speed 
140 32 44 56 68 80 92 
V52 
| 164 
8 176 
188 
200 
n = 0°60 + 0:06, 
r= 0°43 + 0°08, 
/N 


oe ANG ~ 72 = 2°28. 


GROUP V (a) (enlarged, n= 51). 


— > speed 


7» = 0°62 + 0:06, 
r =0°38 + 0°10, 


VN toes 
$74ag BVT — 0 = 226. 
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GROUP VI (additional group, 40 boys, diges 11—12). 


— > speed 
85 110 ss 160 185 2102385 260 


+22 


act. <—— 


1 = 0°38 + 0:09, 
r=0°09 + 0°10, 
VN 4 1 ae 
Sr4ag°3 V2 Sait = Be 


Regression Curve for Combination and Mechanical Memory, 
GROUP I. 


—> combination 
138 26 39 52 65 78 


[e>) 


mech. memory <— 
fo 
Bw 


ive) 
ine) 


40 
n= 0°63 £ 0°05, 
r= 0°52 + 0:06, 

VN 


Siar et 
TTT ec 
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SUBJECT INDEX 


‘* Absolute impression,” error due to, 34 

Addition test, 103 

Adjustments, Sheppard’s, 38, 137 

Amalgamated results, formula for re- 
liability (7) of, 101, 102 

Analograph, 93, 94 

‘ANOS ” test, 102 

Association time, 111 

Asymmetrical (or ‘‘ point ”’) binomial, 
37 


Average deviation (A.D.), 26 


Binomial frequency distribution, 27 
Biometric notation, 26, 36 ff. 
‘* Bisection of lines” test, 104, 105 


**Central factor,” 87, 89, 90, 92, 120, 
127 

Centroid, see Mean 

Combination test, 109, 110, 126 

Concomitant variation, 42, 132 

Constant errors, 15, 17 

Contingency method, 66, 67 

** Contrast,” error due to, 34 

Correlation, 12, 42 ff.; coefficient (7), 
42, 43, 46, 55-57; its significance, 
78-80, 128 ff.; ratio, 42, 43, 47, 
57-59 ; table, example of, 136 

Correspondence of results obtained by 
the three psycho-physical methods, 
32 ff. 

Crude constant error, 17 


Diagram, see ‘‘ Scatter-diagram ” 

‘¢ Difference-hypothesis ” of Fechner, 6 

Discipline, effect upon correlations, 113 

Discrimination, 10, 34; tests for mea- 
suring sensory discrimination, 87 ff. 

Drawing, relation to other mental 
abilities, 96, 126, 127 


Environment producing correlation, 
12, 120, 133 
“ER” test, 101 


Errors of observation, 43, 83 ff. 
Euler’s circles, 52 
Expectation, error due to, 16 


Fatigue, error due to, 15, 16, 34 

‘¢ Foot-rule” formula for correlation 
(Spearman), 71 ff. 

‘* Fore-knowledge ”’ of subject, 34 

Four-fold Table, 65, 66 

Frequency-distributions, 20, 28, 35 ff. 

Fundamental formula for mental mea- 
surement, 4 

Fundamental Table, 29, 134 


Gaussian curve or frequency-distribu- 
tion, 12, 26-28, 37, 39, 60, 68, 72, 
129 ff. 

Grade, 11, 72 


Habituation, error due to, 16 

Heteroclinal, 94 

Heterogeneity of material producing 
spurious correlation, 77, 113, 120 

Hierarchical order of correlation co- 
efficients, 87, 92, 113 

‘¢ Hich contact,” 36, 39 

Homoclinal, 94 

Hypergeometrical series, 37 

Hypothesis, use of correlation in the 
suggestion of, 132, 133 


Index correlation, 77 

Inheritance of psychical characters, 95 

Intensity of sensation not measurable, 
Lie 

Interpolation, 25 ff., 36 ff. 

Introspection, importance of, 11, 14, 
34 

Inversion, of the first order, 19; of 
the second order, 20 

‘‘Trrelevant ” factors, 43, 53, 82 


Just-noticeable differences of sensa- 
tion-intensity, 3 ff. 


154 


Limen, difference-, 3, 30, stimulus-, 
10; importance in Psychology, 10 ; 
determinations of, 13 ff. 

Linear regression, 43, 44, 46, 47, 63 


Mean, 21, 24, 39, 40 

Mean variable error (m.v.), 18 

Mean variation (m.v.), 14, 17, 18, 26 

Measure of precision (h), 26 

Measurement, mental, 1 ff., 35 

Mechanical memory test, 108 

Median, 21, 22 

Memory, for poetry, 109; correlates 
with intelligence, 126 

Mental tests, 11, 17, 81 ff., 99 ff. 

Method, of Limits, 13 ff., 83, 34; of 
Average Error, 17, 18; of Constant 
Stimuli, 18 ff.; of Serial Groups, 
16, 17; of Concomitant Variations, 
42, 1382; of Least Squares, 30, 55 

Mode, 21, 22, 39 

Moment-coefficient or moment, 37 ff. 

‘‘ Motor” test, 102, 103 

Miiller-Lyer Llusion, 105, 106 

Multiple correlation, 51-53, 63 ff., 
128 ff. 


‘*Normal ” correlation, 128 ff. 
“Normal ” frequency-distribution, see 
Gaussian 


Observational errors, elimination of, 
83 ff. 


** Partial” or multiple correlation, 51- 
53, 63ff., 121, 124; probable error 
of, 65 

Poetry, memory for, 109 

‘¢Point’’ binomial, 37 

Practice, error due to inequalities of, 
34 

Precision, measure of (h), 26 

Probability curve, 26 ff. 

Probable error (P.E.), 14, 22, 27, 49, 
59 ff., 73 

Product-moment formula, 46 


INDEX 


** Quotient-hypothesis ” of Plateau, 6-8 


Random sample, 20, 35, 59 

Ranks, correlation of, 11, 48, 49, 50, 
71. 

Ratio, see Correlation ratio 

Regression, 46; line, 47, 56, 57; 
curve, 55; examples of, 145-147; 
coefficients, 56, 57, 63; criterion for 
linearity of, 59 

Reliability, measures of, 14, 23; co- 
efficient of, 83 


“¢ Scatter,’’ measures of, 14, 22, 23, 26 

** Scatter-diagram,”’ 54 

Sensation, see Intensity 

Sense-distance, 1, 2, 3, 5 

Sexual mental differences, 111, 121, 
126, 127 

‘¢ Side-comparisons,”’ error due to, 34 

Skewness of distribution, 22, 39 

Space error, 15-17 

Spurious correlation, 76, 77, 113, 120 

Standard deviation (c), 22, 26, 27, 39 

Stimulus, measurement of as an aid 
to mental measurement, 2, 11 


Threshold, see Limen 

Time error, 15-17 

‘‘ Trisection of lines”’ test, 104, 105 
Two-row Table, 67 ff, 


Unit of measurement, 1; unit sense- 
distance, 2; j. n. d. as unit, 4 


Variability, 23, 44, 84, 86; coefficient, 
86, 125 

Variation, coefficient of, 61 

Vertical-Horizontal Illusion, 106, 107, 
126 


Weber’s Law, 3, 4; interpretations of, 
8-10 

Weights, Table of Coefficients of, 30, 
135 

Weidon’s Experiment, 78, 79 
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